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(54) Modified phytases 

(57) The present invention is directed to a process 
for the production of a modified phytase with improved 
activity properties characterized therein tliat the follow- 
ing steps are effected: 

a) the three dimensional Structure of the phytase to 
be modified and, optionally of another phytase with 
activity properties which are more favorable than 
the ones of the phytase to be modified rs/are com- 
puter modelled on the basis of the three dimen- 
sional structure of the phytase of Aspergillus niger; 

b) the structure of the active sites of the phytase to 
be modified and of the phytase with the more favo- 
rable activity properties are compared and those 
amino acid residues in both active sites which are 
different are identified; 

c) a DNA sequence coding for a modified phytase Is 
constructed by changing the nucleotides coding for 
at least one of the amino adds kiy which kx>th active 
skies differ: 

d) integrating such a DNA sequence into a vector 



capable of expression in a suitable host cell; 

e) transfomning a suitable host cell by the DNA 
sequence of c) or the vector of d), growing said host 
cell under suitable growth conditions and isolating 
the modified phytase from the host cell or the cul- 
ture medium by methods known in the state of the 
art. 

The present invention is also directed to phytases 
obtainable or obtained by such a process. 
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Description 

[0001] Phytases (/nyo-inosHol hexakisphosphate phosphohydrolases; EC 3.1.3.8) are enzymes that hydrolyze 
phytate (myoAnosM hexakisphosphate) to /TTyo-inositd and inaganic phosphate and are known to be valuable feed 

5 additives.. 

[0002] A phytase was first descrbed in rice bran in 1907 [Suzuki et a!., Bull. Coll. Agr. Tokio Imp Univ. Z 495 (1907)] 
and phytases from Aspergillus species In 1911 [Dox and Golden, J. Biol. Chem. 1Q, 183-186 (191 1)]. Phytases have 
also been found in wheat bran, plant seeds, animal intestines and in microorganisms [Howsen and Davis. Enzyme 
Microb. Technol. & 377-382 (1983). Larr^echts et al.. Biotech. Lett. 14. 61-66 (1992), Shieh and Ware. Appt. Microbiol. 
10 IS. 1348-1351 (1968)]. 

[0003] The cloning and expression of the phytase from Aspergillus niger ^Icuum) has been described by Van Hart- 
IngsvekJt et al., in Gene, 12Z. 87-94 (1993) and in European Patent Application. Publication No. (EP) 420 358 and from 
Aspergillus niger var. awamori by Piddington et al.. in Gene 132, 55-62 (1993). 

[0004] Cloning, expression and purification of phytases with improved properties have been disclosed in EP 684 313. 
75 However, since there is a still ongoing need fa further improved phytases. especially with respect to the activity prop- 
erties, it is an object of the present invention to provkte the following: 

i) A process for the production of a modified phytase with improved activity properties characterized therein that the 
following steps are effected: 

20 

a) the three dimensional structure of the phytase to be modified and, optionally of another phytase with activity 
properties which are more favorable than the ones of the phytase to be modified is/are computer modelled on 
the basis of the three dimensional structure of the phytase of Aspergillus niger (f icuum); 

25 b) the structure of the active sites of the phytase to be modified and of the phytase with the more favoratde 

activity properties are compared and those amino ackJ residues in both active sites which are different are 
identified: 

c) a DNA sequence coding for a modified phytase is constructed txy changing the nucleotides coding for at 
30 least one of the amino acids by which both active sites differ; 

d) integrating such a DNA sequence into a vector capable of expression in a suitable host cell; 

e) transforming a suitable host cell by the DNA sequence of c) or the vector of d), growing said host cell under 
35 suitable growth conditions and isolating the modified phytase from the host cell or the culture medium by meth- 
ods known in the state of the art; or 

ii) a process as described under i) wherein the phytase to be modified is of eukaryotic, preferably fungal, more pref- 
erably Aspergillus. e.g. Aspergillus fumigatus origin; or 

40 

iii) a process as described under i) or ii) wherein the phytase with more favorable activity properties is of eukaryotic. 
preferably fungal, more preferably Aspergillus, e.g. Aspergillus niger or Aspergillus ten-eus (Aspergillus terreus cbs 
116.46 or 9A1) origin; or 

45 iv) a process as described under i). ii) or iii) wherein the phytase to be modified is a phytase of Aspergillus fumiga- 
tus and the phytase with the more favorable activity properties is the Aspergillus terreus phytase or the phytase of 
Aspergillus niger. 

[0005] In this contact it should k>e mentioned that another possibiliy of producing phytases with improved properties 
so is by isolating phytases from the same organism, like fbr example the Aspergillus f icuum. but different strains which can 
be found in nature and have been deposited by any of the known depository authorities. Their amino add sequences 
can t>e determined by cloning their corresponding DNA sequences by methods as de8crik>ed, eg. in European Patent 
Application No. (EP) 684 31 3. Once such sequences have been d^ined they can be modeled on the basis of the three- 
dimensional structure of the A. niger phytase and the active sites of both sequences can be compared to find out 
55 whether such phytase shouki have improved activity properties (see Example 8) or both active site sequences can be 
compared directly and than tested fbr increased and/or improved activity by the assays described in the present appli- 
cation. 

[0006] It Is furthermore an object of the present invention to provide a modified phytase which is obtainable by a proc- 
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ess as described above. 

[0007] It is in general an object of the present invention to provide a phytase which has been modified in a way that 
its activity property is nrtore favorak>le than the one of the non-modified phytase, specifically such a phytase character- 
ized therein that the amino add sequence of the non-modified phytase has been changed by deletion, substitution 

5 and/or addition of one or more amino adds, more specifically such a phytase wherein cimnges have been made at at 
least one position which is homobgous to one of the followir^ positions of the amino acid sequence of the phytase of 
Aspergillus (A.) niger (see Fig.1): 27. 66, 71, 103, 140, 141. 188, 205, 234, 235. 238, 274. 277. 282. 340 and^r 424. 
preferably 27. 66. 140, 205. 274. 277, 282 and/or 340. and even more ^edfically such a phytase which is the phytase 
of eukaryotic. preferably fungal, more preferably Aspergillus and most preferably Aspergillus fumigatus. origin. 

10 [C^iOB] It is furthermore an object of the pr^ent invention to provide such a phytase wherein at position 27 or at least 
at position 27 a change occurs, preferably a phytase wherein the amino add at position 27 is replaced by one selected 
from one of the following groups: 

a) Ala, Val, Leu, Ue; or 
IS b)Thror 

c) Asn; artd furthermore such a phytase wherein in addition to position 27 a change occurs also at position 66 or 
wherein in addition to position 27 a change occurs also at position 140 and/or at positions 274 and/or 277. 

[0009] It is also an ok^ect of the present invention to provide a phytase as ^ecified above which is characterized by 
20 at least one of the following mutations: Q27L, C327N, Q27T, Q27I. Q27V. Q27A. Q27G, S66D. S140Y, D141 G, A205E. 
Q274L. G277D, G277K Y282H and/or N340S. 

[001 0] It is furthermore an object of the present invention to provide phytase muteins which are resistant against deg- 
radation by proteases of fungal, preferably A^ergillus and most preferably Aspergillus niger (ftcuum) origin. Such 
muteins are characterized therein that at least at one of tiie following positions (which refers to the homologous position 

25 in tiie amino acid sequence of A. niger), namely position 130 or 129 and 130, preferably of the Aspergillus fumigatus or 
167. 168 preferably of the A. rtidulans phytase amino add sequence, tiie amino acid which is present in the wild type 
sequence has been replaced against another amino acid which Is known to change tiie protease sensitivity, e.g. in tiie 
case off A. fumigatus at position 130 from "S" to "N" and at position 129 from "R" to "L" and In case of A. nidulans at 
position 1 67 from "K" to "G" and at position 1 68 from R to Q. Such positions can be also combined witti tiiose providing 

30 for improved activity propelies. 

[0011] In this context -improved activity property" means any type of improvement of ttie activity of the mutated 
phytase as compared to the non mutated. This could mean for example a higher ^ecific activity, preferably at least two 
fold or more preferably at least 3 to 4 fold higher in an assay known in the state of the art to measure phytase activity, 
see e.g. in EP 684 3 1 3 or described in the exanptes of tiie present application. Furttiermore this couM mean a different 

35 substrate specificity determined in an assay known in the state of the art or as described e.g. in the specific exanfiples 
of ttie present invention. This oould also mean a maximum of the specific activity at a different more tavorable pH or a 
broad pH optinuim ("improved pH profile") deterntined by an assay as known in the state of the art or as described e.g. 
in the examples. Finally tiiis ooukJ also mean any conr^ination of such properties. 

[001 2] "Homologous" in the context of the present Invention means the best fit of the primary, preferably also second- 

40 ary and most preferably also tertiary structure of tiie phytase to be modified and tiie phytase of Aspergillus niger. How 
such b^tfit can be obtained is described in detail in Example 1 of the present invention. Figure 1 gives an example of 
such best fit for tiie phytase amino acid sequences of Aspergillus fumigatus and Asp&gillus terreus aligned on the basis 
of the Aspergillus niger anvno ackJ sequence which latter sequence is also used as tiie reference to which the positions 
of tiie other sequences, e.g. the ones named before, are referred to. Furthermore tiie modified Aspergillus fumigatus 

4S phytase witti the Q27L mutation, means notiiing else than the phytase of A^rgillus funrvgatus wherein at position 27 
according to ttie assignment as defined above (which is in fact position 23 of ttie A^ergillus fumigatus amino ackJ 
sequence) the naturally occurring glutamine ("Q" refers to ttie standard URAC one fetter amino acid code) has been 
replaced by leucine ("L"). All muteins of ttie present invention are designated in ttiis way independent from wether tt)ey 
are protease resistant muteins or muteins with improved activity properties. 

50 [001 3] It Is furthermore an object of the present invention to provkie a DNA-sequence comprising a ONA sequence 
coding for a phytase as desaibed above, a vector, preferably an expression vector, comprising such a DNA sequence, 
a host cell which has bean transformed by such a DNA sequence or vector, a process for the preparation of a pt^tase 
of tiie present invention wherein tfie host cell as described betore is cultured under suitable culture conditions and tiie 
phytase is isolated from such host cell or the culture medium by metticds known in the art, and a food or feed compo- 

55 sition comprising a phytase of the present invention. 

[0014] In tills context it should be noted that it is also an object of the present invention to provide a DNA sequence 
which codes for a phytase can-ylng at least one of the specific mutations of the present invention and which hybridizes 
under standard conditions witti the DNA sequences of tiie specific modified phytases of the present invention or a DNA 



3 



EP0897010A2 



sequence which, because of the degeneracy of the genetic code does not hybridize but which codes for a polypeptide 
with exactly the same amino acid sequence as the one encoded by the DNA sequence to which it does not hytwidize 
or a DNA sequence which is a fragment of such DNA sequences which maintains the activity properties of the polypep- 
tide of which it is a fragment. 

5 [001 5] "Standard conditions*' for hybridization mean in the context the conditions which are generally used by a man 
sidlled in the art to detect specific hybridization signals and which are described, e.g. by Saniirook et el.. "Molecular 
Cloning", second edition. Cold Spring Harbor Laboratory Press 1989, New Yak, or preferably so called stringent hybrid- 
ization and non-stringent washing conditions or more preferably so called stringent hybridization and stringent washing 
conditions a man skilled in the art is familiar with and which are described, e.g. in Sambrook et al. (s.a.). 

10 [0016] It is furthermore an object of the present inverrtion to provkie a DNA sequence which can be obtained by the 
so called polymerase chain reaction method ("PCR") by PCR primers designed on the basis of the specrficafly 
described DNA sequences of the present invention. It is understood that the so obtained DNA sequences code for 
phytases with at least the same mutation as the ones from which th^ are designed and show conparable activity prop- 
erties. 

IS [0017] The principles of the polymerase chain reaction (PCR) method are outlined e.g. by White et al.. Trends in 
Genetics, S. 185-189 (1989). whereas inproved methods are described e.g. in innis et al. [PCR Protocols: A guide to 
Methods and Applications, Academic Press, Inc. (1990)]. 

[0018] DNA sequences of the present invention can be constructed starting from genomic or cDNA sequences coding 
for phytases known in the state of the art [for sequence information see references mentioned above, e.g. EP 684 313 

so or sequence data bases, for example like Genk)ank (Intelllgenetics, Calrfbrnia. USA). European Bioinfbrmatics Institute 
(Hinston Hall, CambrkJge, GB). NBRF (Georgetown University. Medtoal Centre. Washington DC, USA) and Vecbase 
(University of Wisconsin. Biotechnology Centre, Madison. Wisconsin. USA) or disclosed in the figures by methods of in 
vitro mutagenesis [see e.g. Sambrook et al.. Molecular Qoning. CoM Spring Hartx)r Laboratory Press. New \brk]. A 
wkJely used strategy for such "site directed mutagenesis", as originally outlined by Hurchinson and Edgell [J. Virol, g, 

25 181 (1971)]. involves the annealing of a synthetic oligonucleotide carrying the desired nucleotide substitution to a target 
region of a single-strarxled DNA sequence wherein the mutation should be introduced [for review see Smith. Annu. Rev. 
Genet. IS. 423 (1985) and for inproved methods see references 2-6 in Stanssen et al., Nucl. Add Res., 17. 4441-4454 
(1989)]. Another possibility of mutating a given DNA sequence which is also preferred for the practice of the present 
invention is the mutagenesis by using the polymerase chain reaction (PCR). DNA as starting material can be isolated 

30 tiy methods known in the art and described e.g. in Sambrook et al. (Molecular Cloning) from the respective strains. For 
strain information see. e.g. EP 684 313 or any depository authority indtoated below. Aspergillus niger [ATCG 9142], 
Mycellophthora thermophila [ATCC 481 02]. "Rdaromyces thermophllus [ATCC 20186] and Aspergillus fumigatus [ATCG 
346251 have been redeposited on March 14. 1997 according to the condittons of the Budapest Treaty at the American 
Type Culture Cell Collection under the following accession numbers: ATCC 74337, ATCC 74340, ATCC 74338 and 

35 ATCC 74339. respectively. It is however, understood that DNA encoding a phytase to be mutated in accordance with 
the present invention can also be prepared on the basis of a known DNA sequence. e.g. as shown in Fig. 6 In a syn- 
thetic manner and described e.g. in EP 747 483 k>y methods known In the art. 

[0019] Once complete DNA sequences of the present invention have been obtained they can be integrated into vec- 
tors by methods known In the art and described e.g. In Sambrook et al. (s.a.) to overexpress the encoded polypeptide 

40 in appropriate host systema However, a man skilled in the art knows that also the DNA sequences themse^es can be 
used to transform the suitable host systems of the invention to get overexpression of the encoded polypeptide. Appro- 
priate host systems are for example fungi, like Aspergilli, e.g. Aspergillus niger [ATCC 9142] or A^ergillus ftouum 
[NRRL 3135] or like Trichodenna, e.g. Trichoderma reesel or yeasts, like Saocharomyces. e.g. Saccharomyces cerevi- 
siae or Pichia. like Pichia pastoris. or Hansenula polymorpha, e.g. H. pdymorpha (DSM5215). A man skilled in the art 

46 knows that such microorganisms are available from depository authorities, e.g. the American Type Culture Collection 
(ATCC). the Centraalbureau voor Schimmek;ulture5 (CBS) or the Deutsche Sammlung fOr Mikroorganismen und Zel- 
IkuHuren QmbH (DSM) or any other depository authority as listed in the Journal "Industrial Property" [(1991) 1, pages 
29^]. Bacteria which can be used are e.g. E. coil. Bacilli as, e.g. Bacillus subtilis or Streptomyces. e.g. Streptomyces 
Irvklans (see e.g. Ann6 and Mallaert in FEMS Microbiol. Letters IHi 121 (1993). E. coii. which could be used are E. 

BO coll K12 strains e.g. M15 [descrbed as DZ 291 by Villarejo et al. In J. Bacterid, m 466-474 (1974)], HB 101 [ATCC 
No. 33694] or E. ooii SQ13009 [pottesnnan et al., J. Bacteriol. 148. 265-273 (1981)]. 

[0020] Vectors wNch can t>e used for expressbn in fungi are known in ttie art and described e.g. in EP 420 358, or 
by Cullen et al. [Bio/Technology St, 369-376 (1 987)] or Ward In Molecular Industrial Mycology. Systems and Applications 
for Filamentous Fungi, Marcel Dekker, New York (1991). Upshall et al. [Bio/Technotogy S. 1301-1304 (1987)] Gwynne 
55 et al. [Bio/Technology 5. 71-79 (1987)]. Punt et al. [J. Biotechnol. U. 19^4 (1991)] and for yeast by Sreekrishna et al. 
[J. Basic Microbiol. 2S. 265-278 (1988), Biochemistry gg. 4117-4125 (1989)], HItzemann et al. [Nature 717-722 
(1981)] or in EP 183 070. EP 183 071 . EP 248 227, EP 263 31 1 . Suitable vectors which can be used for expression in 
E. coll are mentioned, ag. by Sambrook et al. [s.a.] or by Fiers et al. in Prood. 8th Int. Biotechnok)gy Symposium" [Soc. 
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Franc, de Microbiol.. Paris (Durand et al., eds.). pp. 680-697 (1988)] or by Bujard et al. in Methods In Enzymotogy, eds. 
Wu and Grossmann. Academic Press, Inc. Vol. 41 6433 (1987) and StOber et ai. in Immunological Methods, eds. 
LefkovHs and Pernis, Acadenrtic Press. Inc., Vol. IV, 121-152 (1990). Vectors which oould be used lor eKpression in 
Bacilli are Knowvn in the art and described, e.g. in EP 405 370, Procd. iSlatl. Acad. Sd. USA 81 439 (1984) by Yanajra 

5 and Henner. IVIeth. Enzymol. Ifig. 199-228 (1990) or EP 207 459. Vectors which can be used for the expression in H. 
Polymorpha are known in the art and described, e.g. in Qellissen et al., Biotechnology g. 291-295 (1991). 
[0021] Either such vectors already carry regulatory elements, ag. promotors. or the DNA secj^ences of the present 
invention can be engineered to contain such elements. Suitable promoter elements which can be used are known in the 
art and are. e.g. for Trichoderma reesei the cbhl - [Haarki et a!.. Biotechnology Z 598-600 (1989)] or the pld 1 -promotor 

10 [Schindler et al.. Gene 271-275 (1993)], for A^ergillus oryzae the amyiDromotor [ChristerBen et aL, Abstr. 19th 
Liinter^ Lectures on Molecular Genetics F23 (1987). Christensen et al.. Biotechnology 6. 1419-1422 (1988), Tada et 
al., Mol. Gen. Genet. 222. 301 (1991)], for A^ergillus niger the glaA- [Cullen et al.. BionTechnology 5, 369-376 (1987). 
Qwynne et al., BioHechnology 5, 713-719 (1 987), Ward in Molecular Industrial Mycology, Systems and Applications for 
RIamentous Fungi, Marcel DeWcer, New YorK 83-106 (1991)], aIcA- [Gwynne et al., BioAiK^inology 5. 718-719 (1987)]. 

ts suci- [Boddy et al.. Cun. Genet. 24. 60-66 (1993)], aphA- [MacRae et al.. Gene 21 339-348 (1988), MacRae et al.. 
Genem 193-198(1993)]. 1piA-[McKnightetal.,Cell^ 143-147(1986), Upshatletal., Bio/Technology S. 1301-1304 
(1987)]. gpdA- [Punt et al.. Gene gl 49-57 (1988), Punt et al., J. Biotechnol. IZ 19-37 (1991)] and the pWA-promotor 
[de Graaff et al., Curr. Genet. 21-27 (1992)]. Suitable promoter elements which oould be used for expression in 
yeast are known in the art and are. e.g. the pho5-promotor [Vogel et al.. Md. Cell. Biol.. 2050-2057 (1989): Rudolf and 

20 Hinnen. Proc. Natl. Acad. Sd. 84. 1340-1344 (1987)] or the gap-promotor for expression in Saccharomyces cerevisiae 
and for Pichia pastoris. e.g. the aoxl-promotor [Koutz et aL. Yeast §, 167-177 (1989); Sreelaishna et al., J. Basic Micro- 
biol. 2S. 265-278 (1988)], or the FMD promoter [Hdlenberg et al., EPA Na 0299108] or MOX-promotor [Ledeboer et al.. 
Nucleic Adds Res. 13, 3063-3082 (1985)] for H. polymorpha. 

[0022] Accordingly vectcM's comprising DNA sequences of the present invention, preferably for the expression of said 
25 DNA sequences in bacteria or a fungal or a yeast host and such transfbrmed bacteria or fungal or yeast hosts are also 
an object of the present invention. 

[0023] Once such DNA sequences have been expressed in an appropriate host cell in a suitable medium the encoded 
phytase can be isolated either from the medium in the case the phytase is secreted into the medium or from the host 
organism in case such phytase is present intracellularly by methods known in the art of protein purification or described. 
30 e.g. in EP 420 358. Accordingly a process for the preparation of a polypeptide of the present invention characterized in 
that transfbrmed bacteria or a host cell as described atx>ve is cultured under suitable culture conditions arid the 
polypeptide is recovered therefrom and a polypeptide when produced by such a process or a polypeptide encoded by 
a DNA sequence of the present invention are also an abject of the present invention. 

[0024] Phytase of the present invention can be also expressed in plants according to methods as described. e.g. by 

35 Pen et al. in BioHechnology U. 81 1-614 (1994) or in EP 449 375, preferably in seeds as described. e.g. in EP 449 376. 
[0025] For example, a DNA sequence encoding a phytase of the present invention can be placed under the control 
of regulatory sequences from the gene encoding the 128 storage protein cruciferin from Brassica napus. Ibe construct 
is thereafter subdoned into a binary vector such as PMOG23 (In E. coli K-12 strain DHSo. deposited at the Centraal 
Burrau voor ScWmmelcultures. Baarn, Ttie Netherlands under accession number CBS 102.90). This vector is intro- 

40 duced into Agrobacterium tumefadere which contains a disarmed Ti plasmid. Bacterial cells containing this contruct 
are co-cultivated with tissues from tobacco or Brassica plants, and transformed plant cells are selected by nutrient 
media containing antibiotics and induced to regenerate into differentierted plants on such media. The resulting plants 
will produce seeds that contain and express the DNA contruct Or the phytase-encoding DNA sequence can be placed 
under the control of regulatory sequences from the 35S promoter of Cauliflower Mosaic Virus (CaMV). The oontruct is 

46 thereafter subdoned into a binary vector. This vector is then introduced into Agrobacterium tumefadens which contains 
a disarmed Ti plasmid. Bacterial cells containing this construct are cocultivated with tissues from tobacco or Brassica 
plants, and transformed plant cells are selected by nutrient media containing antibiotics and induced to regenerate into 
differentiated plants on such media. The resulting plants contain and express the DNA construct constitutively. 
[0026] The plant or plant part containing phytase can be used directiy for the preparation of a feed composition or can 

so be extracted from plants or plant organs kiy methods known in the art. Accordingly it is also an object of the present 
invention to provide a process tor the production of the phytases of the present invention in plants or plant organs. Iil(e 
seeds, the phytases when produced fesy such methods, the transfbrmed plants and pbnt organs, like seeds itself. 
[0027] Once obtained the polypeptides of the present invention can be characterized regarding tiieir properties which 
make them useful in agriculture any assay known in the art and described e.g. by Simons et al. [Br. J. Nutr. £4, 525-540 

55 (1990)]. SchOner et al. [J. Anim. Physiol, a. Anim. Nutr. gg. 248-255 (1991)]. Vogt [Arch. QeflQgelk. SS. 93-98 (1992)]. 
JongWoed et al. [J. Anim. Sd., 20. 1159-1168 (1992)], Perney et al. [Poultry Sd. 32. 2106-21 14 (1993)], Farrell et al.. 
[J. Anim. Physiol, a Anim. Nutr. fig. 278-283 (1993), Broz et al., [Br. Poultry Sd. 3& 273-280 (1994)1 and DQngelhoeff 
et al. [Animal Feed Sci. Technol. 49. M 0 (1 994)] can be used. 
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[0028] In general the polypeptides of the present invention can be used without being limited to a specific field of appli- 
cation for the conversion of inosHot polyphosphates, like phytate to inositol and inorganic phosphata 
[0029] Furthermore the polypeptides of the present invention can be used in a process for the preparation of com- 
pound food or feeds wherein the components of such a composition are mixed with one or more polypeptides of the 
present invention. Accordingly compound food or feeds comprising one or more polypeptides of the pxesent invention 
are also an object of the present invention. A man skitled in the art is familiar with their process of preparation. Such 
compound foods or l^eds can further comprise add^ives or components generally used Ibr such purpose and Known in 
the state of the art 

[0030] It is furthermore an object of the present invention to provide a process for the reduction of levels of phytate in 
animal manure characterized in that an animal is fed such a feed composition in an amount effective in converting 
phytate contained in the feedstuff to inositol and inorganic phosphate. 

[0031] Before describing the present invention in more detail a short explanation of the enclosed Figures is given 
below. 

[0032] 

Figure 1 : Primary sequence alignment of A niger (ficuuml A, terreus cbsl 16.46 and A fumigatus [ATCC 13073] 
phytase. Stars show Identical residues within the active site and rectangles, non-identical residues 
within the active site. 

Fi gure 2 : pH optima cun^es. Specific activify of wild-type and mutant A fumigatus phytases is plotted against pH 
of incubation. Filled squares represent A, fumigatus wild-type phytase; Open triangles represent A 
fumigatus Q27L mutant Filled circles represent A fumigatus Q27L, Q274L mutant; Open squares rep- 
resent A fumigatus Q27U Q274L 0277D mutant. 

Fi gure 3 : Substrate specificities of wild-type and mutant A fumigatus phytases. (A) wild-type; (B) Q27L single 
mutant; (C) Q27L, Q274U Q277D triple mutant The following substrates were used: (1) phytic acid; (2) 
p-nitrophenyl phosphate; (3) fructose-1.6-blsphosphate; (4) fructose-6-phosphate; (5) glucose-6-phos- 
phate; (6) ribose-S-phosphate; (7) a-glycerophosphate; (8) ^-glycerophosphate; (9) 3-phosphoglycer- 
ate; (10) phosphoenotpyruvate; (1 1) AMP; (12) ADP; (13) ATP. 

Fi gure 4 : Complete coding sequence and encoded amino acid sequence of the Aspergillus nidulans phytase. 

Figure 5 : Complete coding sequence and encoded amino add sequence of Talaromyces thermcphilus phytasa 

Fi pure 6 : Complete coding sequence and encoded amino add sequence of Aspergillus fumigatus [ATCC 13073] 
phytase. 

Figure 7 : Complete coding sequence and encoded amino add sequence of Aspergillus terreus CBS 116.46 
phytase. 

Fi gure 8 : Crystallographic data of the structure of the Aspergillus niger phytase. 

Figure 9 : Sibstrate specif idties of wild-type and mutant A. fumigatus phytase (N1 -N6). Substrates 1 to 1 3 are as 
indicated for Figure 3. 

Figure 10 : pH optima curves of further mutant A. fumigatus phytases (N1-N6). All activity values were standardized 
(maximum activity a i.o). 

Figigalla : Stereo picture of the threMlimensional fold of A niger (A. ficuum; NRRL 3135) phytase. The active site 
is Micated with a drde and the catalytiGally essential amino acid residues Arg 58 and His 59 are shown 
in ball-arid-stiGk representation. TTis figure was prepared with the programs "MOLSCRIPT" [Kraulis. 
P.J.. J. Appl. Cryst. 24, 946-950 (1991)] and "RASTER3D" [Merritt. E.A & Murphy, M.E.R. Acta CrysL, 
869-873 (1994)]. 

Figure lib : Topological sketch, using the same scheme as in (a). The five disulphtde bridges are shown as black 
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zigzag lines together with the sequence numbers of the cysteine residues involved. The p-strands are 
defined with the sequence numk)er8 A: 48-58. B: 134-138. C: 173-177. D: 332-337. E: 383-391. and F: 
398-403. The cE-helices are defined with the sequence numbers a: 66-82, b: 88-95, c: 107-123. d: 141- 
159, e: 193-197, f: 200-210. g: 213-223, h: 231-246. 1: 257-261. j: 264-281 , k: 290-305, 1: 339-348. m: 
423-429. and n: 439-443. The asterisk at the C-terminal end of p-strand A marks the location of the cat- 
alyticaliy essential amino ackJ residues Arg 58 and His 59. 



Figure 12 : Stereo picture of the active site of A. f icuum (ATCC 1 3073) phytase with a hypothetical binding mode of 
the substrate phytate. In this nxxiel, the bound crystal water molecules were removed and the protein 
atom positions were held fixed, except for small adaptations off the side chain torsion angles of Lys 68 
In order to interact with the substrate. All the conserved amino add residues Arg 58. His 59. Arg 62. Arg 
142. His 338 and Asp 339 form hydrogen bonds to the scissile 3-phosphate group of phytate. as indi- 
cated with lines of small dots. His 59 is in a favorable position to make a nudeophilic attack at the scis- 
sile phosphorous, indicated with a Una of larger dots, and Asp 339 is in a position to protonate the 
leaving group. 

Rpure 13 : Constnjction of the basic plasmids pUC18-AfumgDNA and pUCia-AfumcDNA for site directed muta- 
genesis. 

Fi gure 14a : Primer sets A-N used for site directed mutagenesis. 
Figure 14b : Primer sets 0-T used for site directed mutagenesis. 
Fi gure 15 : Construction of plasmids pgDNATI -pgONAT7. 
Figure 16 : Construction of plasmids pgDNANI -pgDNAN6. 
Fi gure 17a : Construction of plasmids pcTI -pcT7. 

Figure 1 7b : Constmction of plasmids pcTI-Avrll, pcT1-S66D and pcT1-S140Y-D141G 
Figure 17c : Construction of plasmids pcDNA-N27. -T27. -127. -V27, -A27, -027. 
Figure 18 : Construction of plasmids pcNI - pcN6. 

R gure 19 : Plasmkl pAfum-TI tor the e)q}ression of mutein T1 in Aspergillus niger. 

Rgure 20: pH optima curves. Specific activity of wlkl-type and mutant A. fumigatus phytases is plotted against pH 
off incubation. Open triangles: A. fumigatus [ATCC 13073] wikJ-type phytase; Open rhombs: A 
fumigatus Q27Q phytase; Filled squares: A, fumigatus Q27N phytase: Filled triangles: A. fumigatus 
Q27V phytase; Open squares: A fumigatus Q27A phytase; Riled drcles: A. fumigatus 0271 phytase; 
Open drcles: A fumigatus Q27T phytase; Dashed line: A fumigatus Q27L phytase. 

Rgure 21: Substrate specificities of wild-type and mutant A. fumigatus [ATCC 13073] phytases. The used sub- 
strates 1 -1 3 are the same as mentioned In Rgure 3. The speciffic activities of the different phytases with 
any one of the 13 substrates tested are given in the foltowing order (from left to right): A fumigatus wild- 
type phytase, A fumigatus Q27N phytase. A. fumigatus Q27T phytase, A fumigatus Q27L phytase, A 
fumigatus 0271 phytase. A, fumigatus Q27V phytase, A. fumigatus Q27A phytase. A. fumigatus Q27G 
phytase. 

Rgure 22 : pH optima curves. Spedf ic activity of wild-type and mutant A fumigatus [ATCC 1 3073] phytases is plot- 
ted against pH of incubation. Riled rhombs: A fumigatus wild-type phytase; RIed squares: A 
fumigatus 027L single mutant; Open drdes: A fumigatus Q27L-S66D double mutant; Riled triangles: 
A fumigatus Q27L-S140Y-D141Q triple mutant 



iQure 23: Natural variation of phytases in different isolates of A fumigatus [ATCC 13073]. The predicted protein 
sequences are shewn and compared to that of the phytase from A. fumigatus strain ATCC 1 3073. Only 
the amino acids which differ from those in #13073 are shown. 
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Figure 24 : pH dependent specrfic activity of phytases isolated from two different A. fumigatus wildtype strains. 
Open squares: wild-type strain ATGC 13073; Riled circles: strain ATCC 32239. 

Fir qurflgS : Substrate specificities of phytases Isolated from two different A fumigatus wildtype strains. Black bars: 
wild-type strain ATCC 13073; White bats: strain ATCC 32239. 

Flr9ure26 : Construction of ptasmlds pc-S130N. pc-R129L-S130N. pc-Kl67G-R168Q. 

Examples 

HomoloGv Modeling of A. fumigatus and A. terreus cfasi 16.46 phvtase 

[0033] The amino add sequences of A fumigatus [ATCC 13073] (see Rgure 1) and A. terreus cbsl 16.46 phytase 
(see Figure 1) were compared with the sequence of A niger (ficuum) phytase (see Figure 1) for which the three-dimen- 
sional structure had been determined by X-ray crystallography. Crystallographic data are given in Figure 8. 
[0034] A multiple amino acid sequence alignment of A, niger (ficuum) phytase, A. fumigatus phytase and A. terreus 
cbsl 16.46 phytase was calculated with the program TILEUP" (Prog. Menu for the Wisconsin Package, version 8, Sep- 
tember 1994, Genetics Computer Group, 575 Science Drive. Madison Wiscondn, USA 5371 1). The three-dimensional 
models of A. fumigatus phytase and A. terreus cbsl 16.46 phytase were built by using the structure of A. n^er (ficuum) 
phytase as template and ^changing the amino acids of A. niger (ficuum) phytase according to the sequence alignment 
to amino adds of A. fumigatus and A. terreus cbsl 16.46 phytases. respectively. Model construction and energy optimi- 
zation were performed by using the program Moloc (Gerber and MQIIer, 1995). C-alpha positions were kept fixed except 
for new insertionsAleletions and in loop regions distant from the active site. 

[0035] Only small differences of the modelled structures to the original crystal structure could be observed in exterr^l 
loops. Furthermore the different substrate molecules that mainly occur on the degradation pathway of phytic acid {myo- 
inositol-hexakispho^hate) by Pseudomonas $p. tfacterium phytase and, as far as determined, k>y A. niger (ficuum) 
phytase (Cosgrove. 1 980; Rg. 1 ) were constructed and forged into the active site cavity of each phytase structure. Each 
of these substrates was oriented in a hypothetical binding mode proposed for histidine acid phosphatases (VAan Etten, 
1 982). The sdssile phosphate group was oriented towards the catalytically essential His 59 to form the covalent phos- 
phoenzyme intermediate. Ihe oxygen of the substrate phosphoester bond which will be protonated by Asp 339 after 
cleavage was orientated towards the proton donor. Conformational relaxation of the remaining structural part of the 
sut>strates as well as the sunrounding active site residues was performed by energy optimization with the program 
Moloc. 

[0036] Based on the structure models the residues pointing into the active site cavity were identified. More than half 
(60%) of these posHbns were identical between these three phytases, whereas only few positions were not conserved 
(see Figure 1). This observation could be extended to four additional phytase sequences (A nidulans, A tenreus 9A1 , 
Talaromyces thermophilus, Myceliophthora thermophila). 

[0037] The results coming from sequence alignment and structural information including fevouraUe enzyme-subslrate 
interactions were combined to define the positions for mutattonal analysis which are shown In Table 1 . 

References: 

[0038] 

Geri3er, P. and MQOer, K. (1995) Moloc molecular modeling software. J. Comput Aided Mol. Des. 2, 251-268 

Van Etten. R.L (1982) Human prostatic acid phosphatase: a histkline phosphatase. Ann. NY Acad. Sci. 390. 27-50 

Cosgrove. D.J. (1980) Inositol phosphates - their chemistry, biochemistry and physiology: studies in organic chem- 
istry, chapter 4. Elsevier Scientific Publishing Company, Anr«terdam, Oxford. New York. 

Example 2 

Construction of plasmids pUC18-AfumqDNA and pUC18-AfumcDNA 

[0039] Plasmids pUC18-AfumgDNA and pUC18-AfumcDNA the basic constructs for all the A. fumigatus muteins 



8 



EPO 897010 A2 

described below were constructed as follows. 

[0040] pUCIS-AfumgDNA: The genomic DNA sequence of the phytase gene of Aspergillus fumigatus was obtained 
by PGR using the "Expand^ High Fidelity PGR Kit' (Boehringer Mannheim. Mannheim, Germany) with primers #39 and 
#40 (designed on the basis of the genomic sequence shown in Rgure 6) and genomic DNA of Aspergillus fumigatus 
5 [ATGG 13073] from the A. fumigatus (NIH stock 5233) genomic library in a Lambda Fixll vector [Stratagene. Lugolla, 
GA 92037. USA; catalog No. 946055]. 



Primer #39: 
10 BspHI 

5^ TAT ATC ATC AT T ACT CTG ACT TTC CTG CTT TCG 3' 

MI TLTFLLS 

Primer #40: 

EcoRV 



3^ CCT CTC ACG AAA TC A ACT CTA TAG ATA TAT 5^ 
G E C F S * 



20 



[0041] The reaction mix included 10 pmol of each primer and 200 ng of template DNA. 35 rounds of amplification 
were done with the following cycling values: 95 ^G, 1 min/56 ^G, 1 mln/72 <'C, 90 sec. The PGR-amplified Aspergillus 
fumigatus mutein genes had a new BspHI site at the ATO start codon, introduced with primer #39, which resulted in the 
2S change of the second amino acid from a valine to an isoleuclna Furthermore, an EooRV site was created with primer 
#40 downstream of the TQA termination codon of the gene. 

[0042] The PGR fragment (approx. 1450 bp) was subsequently cloned into the Smal site of pUCl8 using the "sure 
clone Kit" (Boehringer Mannheim s.a.) according to the supplier'6 recommendations. The resulting plasmid was named 
pUGIS-AfumgDNA 

30 [0043] pUC18-AfumcDNA: This plasmid lacks the intron (small gap letters in Figure 6} of the A fumigatus phytase 
gene and was constructed as outlined in Rgure 13. Briefly, using primers Fum28 and Fumi 1 the 5' end of axon 2 was 
amplified by PGR (see below), digested with Ncol and EagI (new restriction site introduced with primer Fum28) and 
ligated together with the linker coding for exon 1 made of primers Fum26 and Fum27 into the Xbal and Ncol sites of 
pUGIS-AfumgDNA, tiiereby resulting In plasmid pUCIS-AfumcDNA 

95 



40 

4 



45 



SO 



55 
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Fum28: 

5' ATATA TCGGCCGA GTGTCTGCGGCACCTAGT 3' 

EagI 

Fumll: 

5' TGAGGTCATCCGCACCCAGAG 3' 
Fum26: 

5' CTAGAATTCATGGTGACTCTGACTTTCCTGCTTTCGGCGGCGTATCT 
GCTTTCC 3* 

Fum27: 

5' GGCCGGAAAGCAGATACGCCGCCGAAAGCAGGAAAGTCAGAGTC 
ACCATGAATT 3' 

PGR reaction to get 5' end of exon 2 of the A. fumigatus phytaee: 

2 ^1 template: pUC18-AfumgDNA (20 ng) 

1 ^1 dNTP*8-mix (Boehringer Mannheim s.a.) 
5\i\ 1 0x Buffer 

1 ^1 Taq polymerase (Boehringer Mannheim &a.) 
1.9 III Fum11 (=10pmol) 
2^1 Fum28(Bl0pmol) 
37,1 fit HgO 

[0044] In total 35 cycles with the temperature prcffle: 95^*0 for 30 8ec/56*'C for 30 sec/ 72<'C for 45 sec were made. 
The ampirfied fragment (approx. 330 bp) was extracted once with an equal volume of phenol/chloroform (1 To the 
recovered aqueous phase 0. 1 volume of 3 M sodium acetate. pH 4.8 and 2.5 volumes of ethanol were added. The mix- 
ture was centrifuged for 10 min at 12000 g and the pellet resusp^ded in 20 |il of H2O. Subsequently, the purified frag- 
ment was digested with Ncol and EagI and processed as outlined above. 

'I 

Example 3 

ConstrucHon of muteins of the phvtase of Aspemillus fumioatus for exoressjon in A. niaer 

[0045] To construct all muteins for the expression in A. niger. plasmtd pUC18-AfumgDNA was used as template for 
site-directed mutagenesis. Mutations were introduced using the "quick exchange^ site-directed mutagenesis kit" from 
Stratagene (La Jolla, CA. USA) following the manufacturer's protocol and using the conesponding primers (Figure 14). 
All mutations made are summarized in Table 1 A and B wherein T1 to T7 and N1 to N6, resi^ectively, refer to the muteins 
and "Mutatton" to the amino ackis replaced at such position. For example T5 refers to a mutein with a double mutation: 
L at position 27 for Q and L at position 274 for Q. The primer sets (A-H) used to introduce the con-esponding mutations 
are shown in Rgure 14a. The newly introduced amino aad is shown in bold and the subsaipt indicates the position in 
the mature Aspergillus fumigatus enzyme ooncenning to the numbering of the A. niger amino acid sequenoa Figures 
15 and 16 outline the scheme for the construction off different plasniids pgT1-pgT7 and pgN1-pgN6 encoding the 
muteins carrying only one mutation (T1-T4; N1-N3) or more mutatbns (T5-T7; N4-N6). Clones harboring the desired 
mutations were identified by DNA sequence analysis as known in the art. The mutated phytases were verified com- 
plete sequencing of the genes. 



10 



EP 0 897 010 A2 



Example 4 

Construction of miiteins of the ohvtase of AsoerQlllus fumiqatus for expression in Saccharomvces cerevisiae 

5 [0046] Construction of plasmids pcTI - pcT7 (Figure 17a) and pcN1 - pcN6 (Figure 18), respectively, encoding the 
muteins T1-T7 and N1-N6 for the expression in S. cerevisiae was basically done as outlined in Example 3. Instead of 
using pUCi 8-AfumgDNA as the basic construct to Introduce the mutations, plasmid pUCl8-Af umcDNA was used (Rg- 
ure 13). 

[0047] The plasmids pcDNA-N27, -G27, -V27. -A27, -127 and -T27 encoding the muteins N27. G27, V27, A27, 127 

10 and T27 were constructed as follows: 

[0048] A silent restriction site for Avrll was introduced into plasmid pcTI ty site directed mutagenesis as described in 
Example 3 using primer set I (Figure 14a; Figure 17b). The A. fumigatus phytase gene fragment Avrll/Xhol was then 
replaced by the linker fragment harbouring the desired mutations (Figure 1 7c). Each linker fragment was generated by 
annealing of the respective pairs of synthesized polynucleotides (Fig. 14b; sense and antisense strand; 90 ng each) for 

15 3 min at 70 ""C in 9 fil distilled water. 

[0049] Construction of plasmids pcT1-S66D and pcT1-S140Y-D141G encoding the A. fumigatus Q27L-S66D double 
mutant and the A fumigatus Q27L-S140Y-D141Q triple mutant was basically canried out as described in Example 3. 
Plasmid pcTI. harbouring the mutation coding for Q27L, was used as template for site directed mutagenesis together 
with the conesponding primer sets J and K (Rgure 14a; Rgure 1 7b). 

20 [0050] All mutations were verified by DNA sequence analysis of the entire gene. 

Example 5 

Expression in Aspergillus nig er 

25 

[0051] The genes encoding the aforementioned A. fumigatus wild-type phytase and muteins (Fig. 16) were isolated 
with BspHI and EcoRV from plasmids pgDNAT1-pgDNAT7 and pgDNAN1-pgDNAN6 and ligated into the Ncol site 
downstream of the glucoamylase promoter of Aspergillus rnger (glaA) and the EcoRV site upstream of the Aspergillus 
nidulans tryptophan C terminator (f/pC) (Mullaney et at., 1985). The resulting expression plasmids had In addition the 

30 orotidine-S'-phosphate decart>axylase gene {pyrA) of Neurospora crassa as selection marker. Rgure 19 shows an 
example for such an expression plasnad carrying the gene encoding mutein T1 (van den Hondel et ah. 1 991 ). The basic 
expression plasmid desaibed above corresponds basically to the pGLAC vector desaibed In example 9 of EP 684 31 3. 
Transformation of Aspergillus niger arvJ expression of the muteins was done as described in EP 684 313. 
[0052] The supernatant was concentrated by way of ultrafiltration in Amicon 8400 cells (PM30 membranes) and ultra- 

35 free-1 5 centrifugal filter devices (Biomax-30K. Millipore). 

[0053] TTie concentrate (typically 1 .5-5 ml) was desalted In aliquots of 1 .5 ml on a Fast Desalting HR 10/10 column 
(Pharmacia Biotech), with 10 mM sodium acetate, pH 5.0. sennng as elution buffer. The desalted A. fumigatus samples 
were directly loaded onto a 1.7 ml Poroe HS/M cation exchange chromatography column (PerSeptive Biosyslems, 
Ramingham. MA, USA). A. terreus cbsl 16.46 [CBS 220.95] phytase was directly loaded onto a 1.7 ml Poros HQ/M 

40 anion exchange chromatography cdumn. In both cases, phytase was eluted in pure form by way of a sodium chloride 
gradient. 

References: 

46 [0054] 

Mullaney, E. J.. J. E. Hamer. K. A. Roberti, M. M. Yelton, and W. E. Timbertake. 1985. Primary structure of the trpC 
QBm from Aspergilbs nidulans. Mol. Gen. Genet. 19937-45. 

Van den Hondel, C. A. M. J. J.. P. J. Punt, and R. F. M. van Gorcom. 1991 . Heterologous gene expression in fila- 
60 mentous fungi. In: More gene manipulations in fungi, pp. 396-428. Bennett. J. W. and Lasure, L L (eds.). Aca- 
demic Press Inc.. San Diego. CA. 

Example 6 

55 Expression In Saccfiaromvcss CGfevisias 

[OOSq The Intron less genes encoding the A fiz/n/j^aft/s wild-type phytase and the different muteins (Fig. 17/18) men- 
tioned above were isolated from the respective plasmids pUCl8-AfumcDNA, pcDNATI - pcDNAT7 and pcDNANI - 
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PCDNAN6 wrth EooRI and EcoRV and subcbned either between the blunt ended Xhoi and the EcoRI sites of plasmid 
pYES2 (InvTtrogen. San Diego. CA. USA) or the shortened QAPFL (glyceraldehyde-3-phosphat6 dehydrogenase) pro- 
nnoter and the PH05 tenminator as described by Janes et al. (1990). Transformation of Saccharomyces cerevisiae 
strains, e.g. INVScI (Irn^og^. San Diego, CA, USA) wasdoneacoordingtoHlnnenetal. (1978). Single colonies har- 
bouring the phytase gene under the control of the QAPFL pronx)ter were picked and cultivated in 5ml selection medium 
(SD -uracil) (Sherman et al., 1986) at 30"" C under vigorous shaking (250 rpm) for 1 day. The preculture was then added 
to 500 ml YPD medium (Sherman et al., 1986) and cultivated under the same conditions. After four days cell broth was 
centrifuged (7000 rpm. 0S3 rotor, 15 min. 5*^ C) and the supernatant was collected. Induction of the GAL1 promoter 
(plasmid pYES2 from Invitrogen, San Diego, CA, USA) was done according to the manufacturers instructions. Purifica- 
tion of the muteins was as described in example 5 (s.a.). 

References: 
[0056] 

Janes, M., B. MeyhacK W Zimmermann and A. Hinnen. 1990. The influence off GAP promoter variants on hirudins 
production, avarage plasmid copy number and cell growth in Saccharomyces cerevisiae. Curr. Qenet 18: 97-103 

Hinnen. A., J.B. Hicks and Q.R. Fink. 1978. Proc. Natl. Acad. Sd. USA 75: 1^9-1933 

Sheman, J.P., Finck, G.R. and Hicks, J.B. (1986). Laboratory Course Manual tor Methods in Yeast Genetics. Cokl 
Spring Harbor University Press. 

Example 7 

Determination of ohvtase activity and su bstrate specificity 

[0057] Phytase activity was measured in an assay mixture containing 0.5% phytic acid (~5 mM). 200 mM sodium ace- 
tate. pH 5.0. After 15 min incubation at 37*^C, the reaction was stopped by addition of an equal volume of 15% trichlo- 
roacetic acid. The liberated phosphate Ions were quantified by mixing 1 00 ^1 of the assay mixture with 900 \i\ H2O and 
1 ml of 0.6 M H2SO4, 2% ascorbic acid and 0.5% ammonium motybdate. Standard solutions of potassium phosphate 
were used as reference. 

[0058] In case of pH optimum curves, purified enzymes were diluted in 10 mM sodium acetate, pH 5.0. Incubations 
were started by mixing alk^uots of the diluted protein with an equal volume of 1% phytic ackl (-10 mM) in a series of 
different buffers: 0.4 M glydne/HCt, pH 2.5; 0.4 M acetate/MaOH. pH 3.0, 3.5. 4.0. 4.5. 5.0. 5.5; 0.4 M ImMazole/HCI. 
pH 6.0, 6.5; 0.4 M Tris/HCI, pH 7.0, 7.5. 8.0, 8.5, 9.0. Control experiments showed that pH was only slightly affected by 
the mixing step. Incubations were performed for 15 min at 37''C as described above. 

[0059] For determination of the substrate spedf icrties of wild-type and mutant A fumigatus phytases, phytic add in 
the assay mixture was replaced k>y 5 mM-ooncentrations of the respective phosphate compounds. The activity tests 
were perfbmied as described above. 

[0060] Protein concentrations were calculated from the OD at 280 nm, using theoretical absorption values calculated 
from the known protein sequences with the DMA* software (DNASTAR, Inc., Madison, Wisconsin. USA). An absorption 
of 1 .0 OD at 280 nm coresponds to 0.94 rngftvA A. fumigatus phytase and 0.85 mgAnl of A. terreus cbsl 16.46 phytase. 
[0061] pH profiles of Aspergillus fumigatus mutants T1 (Q27L). T5 (Q27U 0274^ and T6 (Q27U (3274L. G277D) 
have drastically changed compared to the wild-type A. fumigatus phytase (see Figure 2). All mutants showed ec|ual pH 
profiles. Increase in specific activity at pH 5.0 of the muteins as compared to tiie wild-type phytase of Aspergillus fumi- 
gatus is shown in Table 2. Enzyme activities were measured under standard assay conditions at pH 5.0. Several indi- 
vidual measurements (n: number of assays) were averaged. 

[0062] The pH profile of A. fumigatus phytase mutant Q27A resemk)les the pH profile of A. fumigatus wild-type 

phytase over nearly the whole pH range (Rgure 20). Whereas the specific activity of wild-type phytase is decreasing at 

pH values bebw pH 4.0, the specific activity of the phytase mutant Q27A remains nearly constant down to pH 2.9. 

[0063] The single amino acid exchanges Q27U Q27I, Q27V or Q27T have remarksfaly increased the specific activity 

over the whole pH range. espedaDy between pH 5.0 and 7.5 (Figure 20). Maximum values are reached at pH 6.5. In 

addition, mutation Q27T caused the highest specific activity values Ibr phytic add at low pH (pH 3.0^.0). 

[0064] Higher specific activHies are also gained by the single mutations Q27G or Q27N, between pH 2.5 and 7.0, with 

maximum values at pH 6.0 (Figure 20). The specific activity decreases at pH values below 3.5. 

[0065] All single mutants still show a broad sut)strate specif idty which is comparable to that of A fumigatus wild-type 

phytase (Figure 21). Some of the mutants show significantly higher specific activities than other mutants for selected 
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substrates, e. g., the Q27T mutant for p-nitrophenyl phosphate and ATP, or the Q27G mutant for phosphoenolpyruvate. 
[0066] As shown in Rgure 22 the combination of mutation Q27L with S66D or S140Y and D141Q led to a shift of the 
pH profile towards lower pH. The maximum specific activity gained by the single mutation Q27L is further increased by 
the additional amino acid exchanges. 

[0067] As shown in Rgure 3. Aspergillus fumigatus phytase mutant T1 (C327L) showed no difference in substrate spe- 
cif icrty compared to the triple mutant TB (Q27L. Q274L, GI277D). 

[0068] TTie pH profiles of the mutei ns N1 -6. except N2 show significant differences compared to the wild-type phytase 
(Fig. 10). Whereas the pH prctfile of mutein N4 is expanded towards lower pH. the profiles of muteins N3 to fslG are 
shifted towards lower pH. The muteins N5, N6 reach maximum activity already at pH 3.0. 

[0069] The muteins N1 to N6 show in almost all cases a drastic reduction in specific activity for all tested substrates, 
except for phytic add (Rg. 9). Specific activity for phytic acid remained unchanged conpared to the wild-type phytase, 
whereas mutant f43 and N6 show a tendential higher activity (Fig. 19). 



Table 1 



A) Mutations towards A. terreus cbsll6.46 phytase 



Mutation 


n 


T2 


T3 


T4 


T5 


T6 


T7 


Q27L 


X 








X 


X 


X 


Q274L 




X 






X 


X 


X 


G277D 






X 






X 


X 


N340S 








X 






X 



B) Mutations towards A. niger (ficuum) phytase 



Mutation 


Nl 


N2 


N3 


N4 


N5 


N6 


G277K 


X 






X 


X 


X 


A205E 




X 




X 




X 


Y282H 






X 




X 


X 



* 



Takalea 







U/mg 


A fumigatus wild-type phytase 


26.5 ±5.2 


22 
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lyUe 2 (continued) 







UAng 


A. fumigatus Q27L 


83.4 


4 


A. fumigatus Q27L. C2274L 


88.7 ±13.5 


8 


A fumigatus Q27L. Q274L. G2770 


92.3 12.0 


9 


A. terreus cbsl 1 6.46 phytase 


195.8±17.8 


7 



Table 3 



Specific activity under standard assay conditions at pH 5.0. Average standard deviation is 

10%. 




Specific activity [UAng] 


Number of independent 
assays 


A. fumigatus wild-type phytase 


26.5 


22 


A. fumigatus Q27N 


45.5 


3 


A. fumigatus Q27T 


106.9 


3 


A. fumigatus Q27L 


83.4 


4 


A. fumigatus Q27I 


91.2 


3 


A. fumigatus Q27V 


35.0 


3 


A. fumigatus Q27A 


27.3 


3 


A. fumigatus Q27Q 


59.6 


3 


A. fumigatus Q27L-S66D 


118.5 


3 


A. fum^atus Q27L-S140Y-D141G 


193.0 


3 



35 

Example 8 

[0070] As an alternative approach to obtain phytases with modified characteristics and to get a better idea about the 
natural variation found in phytase characteristics within a certain species, naturally occurring variants of A fumigatus 

40 phytase were analysed. Phytase genes were obtained from six different isolates of A. fumigatus. The amino add 
sequence of phytase from two of the A. fumigatus isolates (ATCC 26934 and ATCC 34625) showed no difference to the 
original amino add sequence of wild-type A. fumigatus phytase ATCC 13073. Phytase from three other isolates had 
one or two amino acid substitutions, none of which directly affected the active site. Enzymatic characteristics remained 
unaffected by these substitutions (not shown). The phytase of isolate of A. fumigatus (ATCC 32239) differed in 13 posi- 

46 tions in the signal sequence and 51 positions in the mature part of the protein compared to the original wild-type A. 
fumigatus phytase (ATCC 13073). Several of these substitutions affect variable amino adds of the active site cavity. 
This resulted in an inaease in specific activity with phytic add as substrate (47 U/mg. starK^ard enzyme assay) and in 
loss of enzymatic activity above pH 7 (Fig. 24). Also in this case, the specific activity against phytic acid was increased 
relative to the specific activities with other substrates (Rg. 25). 

so 

Example 9 

[0071] Construction of plasmids pc-S130N, pc*R129L-Sl30N, pc-K167Q-R168Q encoding A. fumigatus [ATCC 
13073] phytase S130N single mutant and R129L'S130N doU)le mutant and A. niduians phytase K167G-R168Q double 
55 mutant was basically carried out as descrlsed in Example 3. Plasmid pUCI 8-Af umcDN A was used as template for site 
directed mutagenesis together with the corresponding primer sets L, M and N (Figure 14a; Figure 26). 
[0072] All mutations were verified by DNA sequence analysis of the entire gene. 
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[0073] When expressed in A. niger and stored as concentrated culture supernatants at A^'C. the phytases from A. 
fumigatus, A. n/c/i/Zans displayed tendency to undergo proteolytic degradation. N-terminal sequencing of fragments 
suggested that cleavage occured between amino acids S130-V131 and K167-R168 or R168-A169. respectively. Com- 
pared with 3D structure of A. niger phytase revealed that all cleavage sites are found within surface-exposed loop struc- 
tures and are therelbre aocessye to proteases. 

10074] Site-directed mutagenesis at protease-sensitive sites of A. fumigatus phytase (S130N. R129L-S130fSO and A. 
nidulans phytase (K167G-R168Q) yielded mutant proteins with considerably reduced susceptibility to proteolysis. 
[0075] In contrast to expression in A. niger, proteolytic degradation was not observed when the phytases were 
expressed in Hansenula polymorpha. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: F. Hoffmann-La Roche AG 

(B) STREET: Grenzacherstrasse 124 

(C) CITY: Basle 
<D) STATE: BS 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP): CH-4002 

(G) TELEPHONE: 061-6882505 
IH) TELEFAX: 061-6881395 

(I) TELEX: 962292/965542 hlr ch 

Ui) TITLE OF INVENTION: Modified Phytases 
(iii) NUMBER OF SEQUENCES: 8 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disJc 

(B) COMPUTER: IBM PC conpatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

20 (D) SOFTWARE: PatentXn Release «1.0, Version #1.30 <EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: EP 98104858.0 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : doiible 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (158.. 204, 259.. 1600) 



(Xil SEQUENCE DESCRIPTION: SEQ ID MO: 1: 

TCTGTAACCG ATAGCGGACC GACTAGGCAT CGTTGATCCA CAATATCTCA GACAATGCAA 
60 

^ CTCAGTCGAA TATGAAGGGC TACAGCCAGC ATTTAAATAC GGCCGTCTAG GTCGGGCTCC 

120 

GGGGATGAGG AG6AGCAGGC TCGTGTTCAT TTCGGTC ATG GCT TTT TTC ACG GTC 
175 

Met Ala Phe Phe Thr Val 

45 1 5 

GCT CTT TCG CTT TAT TAC TTG CTA TCG AG 6TGAGATCTC TACAATATCT 
224 

Ala Leu Ser Leu Tyr Tyr Leu Leu Ser Arg 

10 15 

SO GTCTGCTTAG TTGAATTGGT ACTTATCTGT ACAG A GTC TCT GCT GAG GCC CCA 

277 

Val Ser Ala Gin Ala Pro 

20 



55 
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GTG 


GTC 


CAG 


325 






Val 


Val 


Gin 






25 


TTC 


ccc 


AAT 


373 






Phe 


Pro 


Asn 




40 




ATC 


GAG 


CAG 


421 






He 


Glu 


Gin 


55 






GTT 


ACC 


TTT 


469 






Val 


Thr 


Phe 


GAG 


TCG 


AAG 


517 












AAT 


GCT 


ACC 


565 






Asn 


Ala 


Thr 






105 


TAT 


ACC 


CTC 


613 






Tyr 


Thr 


Leu 



30 



35 



45 



50 



60 



65 



70 



75 



80 



85 



90 



95 



100 



110 



115 



120 



125 



130 



GTT GAT TCG GGT GCC AAG TTC TAC CGA CGG TAT AAG AAT CTC GCC AGG 
661 

Val Asp Ser Gly Ala Lys Phe Tyr Arg Arg Tyr Lys Asn Leu Ala Arg 
135 140 145 150 

AAA AAT ACT CCT TTT ATC CGT GCA TCA GGG TCT GAC CGT GTC GTT GCG 
709 

Lys Asn Thr Pro Phe He Arg Ala Ser Gly Ser Asp Arg Val Val Ala 

155 160 165 

TCT GCG GAG AAG TTC ATT AAT GGA TTT CGC AAG GCT CAG CTC CAC GAC 
757 

Ser Ala Glu Lys Phe He Asn Gly Phe Arg Lys Ala Gin Leu His Asp 
170 175 180 



CAT 


GGC 


TCC 


AAA 


CGT 


GCT 


ACG 


CCA 


GTT 


GTC 


AAT 


GTG 


ATT 


ATC 


CCT 


805 






























His 


Gly 


Ser 


Lys 


Arg 


Ala 


Thr 


Pro 


Val 


Val 


Asn Val 


He 


He 


Pro 






185 










190 










195 






ATC 


GAT 


GGG 


TTT 


AAC 


AAC 


ACC 


CTG 


GAC 


CAT 


AGC 


ACG 


TGC 


GTA 


TCT 


853 






























He 


Asp 


Gly 


Phe 


Asn 


Asn 


Thr 


Leu 


Asp 


His 


Ser 


Thr 


Cys 


Val 


Ser 




200 










205 










210 






GAG 


AAT 


GAT 


GAG 


CGG 


GCG 


GAT 


GAA 


ATT 


GAA 


Gcr 


AAT 


TTC 


ACG 


GCA 


901 






























Glu 


Asn 


Asp 


Glu 


Arg 


Ala 


Asp 


Glu 


He 


Glu 


Ala 


Asn 


Phe 


Thr 


Ala 


215 










220 










225 










ATG 


GGA 


CCT 


CCG 


ATC 


CGC 


AAA 


CGT 


CTG 


GAA 


AAT 


GAC 


CTC 


CCT 


GGC 


949 






























Met 


Gly 


Pro 


Pro 


He 


Arg 


Lys 


Arg 


Leu 


Glu 


Asn 


Asp 


Leu 


Pro 


Gly 



230 



235 



240 



245 
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AAA CTT ACA AAC GAG AAT GTA ATA TAT TTG ATG GAT ATG TGC TCT TTC 
997 

Lys Leu Thr Asn Glu Asa Val lie Tyr Leu Met Asp Met Cys Ser Phe 
250 2S5 260 

GAC ACC ATG GCG CGC ACC GCC CAC GGA ACC GAG CTG TCT CCA TTT TGT 
1045 

Asp Thr Met Ala Arg Thr Ala His Gly Thr Glu Leu Ser Pro Phe Cys 
265 270 275 

GCC ATC TTC ACT GAA AAG GAG TGG CTG CAG TAC GAC TAC CTT CAA TCT 
1093 

Ala He Phe Thr Glu Lys Glu Trp Leu Gin Tyr Asp Tyr Leu Gin Ser 
280 285 290 

CTA TCA AAG TAC TAC GGC TAC GGT GCC GGA AGC CCC CTT GGC CCA GCT 
1141 

Leu Ser Lys Tyr Tyr Gly Tyr Gly Ala Gly Ser Pro Leu Gly Pro Ala 
295 300 305 310 

CAG GGA ATT GGC TTC ACC AAC GAG CTG ATT GCC CGA CTA ACG CAA TCG 
1189 

Gin Gly He Gly Phe Thr Asn Glu Leu lie Ala Arg Leu Thr Gin Ser 

315 320 325 

CCC GTC CAG GAC AAC ACA AGC ACC AAC CAC ACT CTA GAC TCG AAC CCA 

1237 

Pro Val Gin Asp Asn Thr Ser Thr Asn His Thr Leu Asp Ser Asn Pro 
330 335 340 

GCC ACA TTT CCG CTC GAC AGG AAG CTC TAC GCC GAC TTC TCC CAC GAC 
1285 

Ala Thr Phe Pro Leu Asp Arg Lys Leu Tyr Ala Asp Phe Ser His Asp 
345 350 355 

AAT AGC ATG ATA TCG ATA TTC TTC GCC ATG GGT CTG TAC AAC GGC ACC 
1333 

Asn Ser Met He Ser He Phe Phe Ala Met Gly Leu Tyr Asn Gly Thr 
360 365 370 

CAG CCG CTG TCA ATG GAT TCC GTG GAG TCG ATC CAG GAG ATG GAC GGT 

1381 

Gin Pro Leu Ser Met Asp Ser Val Glu Ser He Gin Glu Met Asp Gly 
375 380 385 390 

TAC GCG GCG TCT TGG ACT GTT CCG TTT GGT GCG AGG GCT TAC TTT GAG 
1429 

Tyr Ala Ala Ser Trp Thr Val Pro Phe Gly Ala Arg Ala Tyr Phe Glu 

395 400 405 

CTC ATG CAG TGC GAG AAG AAG GAG CCG CTT GTG CGG GTA TTA GTG AAT 
1477 

Leu Met Gin Cys Glu Lys Lys Glu Pro Leu Val Arg Val Leu Val Asn 
410 415 420 

GAT CGC GTT GTT COT CTT CAT GGC TGC GCA GTT GAC AAG TTT GGA CGG 
1525 

Asp Arg Val Val Pro Leu His Gly Cys Ala Val Asp Lys Phe Gly Arg 
425 430 435 

TGC >CT TTG GAC GAT TGG GTA GAG GGC TTG AAT TTT GCA AGG AGC GGC 
1573 

Cys Thr Leu Asp Asp Trp Val Glu Gly Leu Asn Phe Ala Arg Ser Gly 
440 445 450 

GGG AAC TGG AAG ACT TGT TTT ACC CTA TAAAGGGCGT TTGCTCATTC 

1620 

Gly Asn Trp Lys Thr Cys Phe Thr Leu 
455 460 
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ATAAGTGTTG TGCAGGTATA 
1680 

GACCAAGAAT GATTTGTTTG 
1740 

ACCTATCCTC CATGTGTAGG 
1800 

TTTCACCAAA CATATCCTCG 
1860 

TCTATCTATA TCGTCAACAA 
1920 

AAATTTCTTC G 
1931 



GGAAGGTTAG GGAATTAGCT 
TTCTCAAGGC CTTCTAGCAT 
TGAACCCGCT CTTGCATCAA 
TGTCCTCTCT TCTGCTCTTC 
AACTACCACC CAAACACCAA 



GTTTGGCTTT ACTCTTATTA 
ATCGTCAAGT GGGATAAATC 
CCTCTT6TGT TTCAGAGTA6 
GGTCTCATAT TACACTGTTC 
ATGTCACACT TTCCAGCACG 



(2) INFORMATION FOR SEQ ID NO: 2: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 2: 



Met Ala Phe Phe Thr Val Ala Leu Ser Leu Tyr Tyr Leu Leu Ser Arg 
15 10 15 

Val Ser Ala Gin Ala Pro Val Val Gin Asn His Ser Cys Asn Thr Ala 

20 25 30 

Asp Gly Gly Tyr Gin Cys Phe Pro Asn Val Ser His Val Trp Gly Gin 
35 40 45 

Tyr Ser Pro Tyr Phe Ser lie Glu Gin Glu Ser Ala He Ser Glu Asp 
50 55 60 

Val Pro Hie Gly Cys Glu Val Thr Phe Val Gin Val Leu Ser Arg His 
65 70 75 80 

Gly Ala Arg Tyr Pro Thr Glu Ser Lys Ser Lys Ala Tyr Ser Gly Leu 

85 90 95 

He Glu Ala He Gin Lys Asn Ala Thr Ser Phe Trp Gly Gin Tyr Ala 
100 105 110 

Phe Leu Glu Ser Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr He 
115 120 125 

Phe Gly Glu Asn Gin Met Val Asp Ser Gly Ala Lys Phe Tyr Arg Arg 
130 135 140 

Tyr Lys Asn Leu Ala Arg Lys Asn Thr Pro Phe He Arg Ala Ser Gly 
145 150 155 160 

Ser Asp Arg Val Val Ala Ser Ala Glu Lys Phe He Asn Gly Phe Arg 

165 170 175 

Lys Ala Gin Leu His Asp His Gly Ser Lys Arg Ala Thr Pro Val Val 
180 185 190 

Asn Val He He Pro Glu He Asp Gly Phe Asn Asn Thr Leu Asp His 
195 200 205 



Ser Thr Cys Val Ser Phe Glu Asn Asp Glu Arg Ala Asp Glu He Glu 
210 215 220 
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Ala Asn Phe Thr Ala lie Met Gly Pro Pro He Arg Lys Arg Leu Glu 
225 230 235 240 

Asn Asp Leu Pro Gly He Lys Leu Thr Asn Glu Asn Val He Tyr Leu 

245 250 255 

Met Asp Met Cys Ser Phe Asp Thr Met Ala Arg Thr Ala His Gly Thr 
260 265 270 

Glu Leu Ser Pro Phe Cys Ala He Phe Thr Glu Lys Glu Trp Leu Gin 
275 280 285 

Tyr Asp Tyr Leu Gin Ser Leu Ser Lys Tyr Tyr Gly Tyr Gly Ala Gly 
290 295 300 

Ser Pro Leu Gly Pro Ala Gin Gly He Gly Phe Thr Asn Glu Leu lie 
305 310 315 320 

Ala Arg Leu Thr Gin Ser Pro Val Gin Asp Asn Thr Ser Thr Asn His 

325 330 335 

Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asp Arg Lys Leu Tyr 
340 345 350 

Ala Asp Phe Ser His Asp Asn Ser Met He Ser He Phe Phe Ala Met 
355 360 365 

Gly Leu Tyr Asn Gly Thr Gin Pro Leu Ser Met Asp Ser Val Glu Ser 
370 375 380 

He Gin Glu Met Asp Gly Tyr Ala Ala Ser Trp Thr Val Pro Phe Gly 
385 390 395 400 

Ala Arg Ala Tyr Phe Glu Leu Met Gin Cys Glu Lys Lys Glu Pro Leu 

405 410 415 

Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Ala 
420 425 430 

Val Asp Lys Phe Gly Arg Cys Thr Leu Asp Asp Trp Val Glu Gly Leu 
435 440 445 

Asn Phe Ala Arg Ser Gly Gly Asn Trp Lys Thr Cys Phe Thr Leu 
450 455 460 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: DNA ( genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join (288. .334, 390.. 1740) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TTCCACGCTG AAAGCCTGAC TGCGATTTCC AAGCTGCATG CAGGCTGCTC AACTGCCTGC 
60 

TTATCTTCAT CAGACGCAGA TACACAACCT GGTCTGTAGA TGCACCCATG ACGGACGAAC 
120 
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GCACCGCTCT CTTGGCCTCC AGGGACCCC5G AGGTCGftGGG CGATGAGGTC GCGCCCTCGA 
180 

CGGCCTCCCA GTCCCTGTTG CAGTTGAGAT CTCGCTGCGA ACGTCGACCG CAGATATGGT 
240 

TGTCTTCGAC GTTTTCTCGC CTTCGAGGAA GAATTGCTGC TGTGACG ATG AGT CTG 
296 

Met Ser Leu 
1 

TTG TTG CTG GTG CTG TCC GGC GGG TTG GTC GCG TTA TA GTATGCTCCT 
344 

Leu Leu Leu Val Leu Ser Gly Gly Leu Val Ala Leu Tyr 
5 10 15 

TCTCTCTGGT CATATTGTTT TCTGCTAACG TTCTCATAAT TGAAG T GTC TCA AGA 
399 

Val Ser Arg 

AAT CCG CAT GTT GAT AGC CAC TCT TGC AAT ACA GTG GAA GGA GGG TAT 
447 

Asn Pro His Val Asp Ser His Ser Cys Asn Thr Val Glu Gly Gly Tyr 
20 25 3D 35 

CAG TGT CGT CCA GAA ATC TCC CAC TCC TGG GGC CAG TAT TCT CCA TTC 
495 

Gin Cys Arg Pro Glu He Ser His Ser Trp Gly Gin Tyr Ser Pro Phe 

40 45 50 

TTC TCC CTG GCA GAC CAG TCG GAG ATC TCG CCA GAT GTC CCA CAG AAC 
543 

Phe Ser Leu Ala Asp Gin Ser Glu He Ser Pro Asp Val Pro Gin Asn 

55 60 65 

TGC AAG ATT ACG TTT GTC CAG CTG CTT TCT CGT CAC GGC GCT AGA TAC 
591 

Cys Lys lie Thr Phe Val Gin Leu Leu Ser Arg His Gly Ala Arg Tyr 
70 75 80 

CCT ACG TCT TCC AAG ACG GAG CTG TAT TCG CAG CTG ATC AGT CGG ATT 
639 

Pro Thr Ser Ser Lys Thr Glu Leu Tyr Ser Gin Leu He Ser Arg He 
85 90 95 

CAG AAG ACG GCG ACT GCG TAC AAA GGC TAC TAT GCC TTC TTG AAA GAC 
687 

Gin Lys Thr Ala Thr Ala Tyr Lys Gly Tyr Tyr Ala Phe Leu Lys Asp 
100 105 110 H5 

TAC AGA TAC CAG CTG GGA GCG AAC GAC CTG ACG CCC TTT GGG GAA AAC 
735 

Tyr Arg TVr Gin Leu Gly Ala Asn Asp Leu Thr Pro Phe Gly Glu Asn 

120 125 130 

CAG ATG ATC CAG TTG GGC ATC AAG TTT TAT AAC CAT TAC AAG AGT CTC 
783 

Gin Met He Gin Leu Gly He Lys Phe Tyr Asn His Tyr Lys Ser Leu 
135 140 145 

GCC AGG AAT GCC GTC CCA TTC GTT CGT TGC TCC GGC TCT GAT CGG GTC 
831 

Ala Arg Asn Ala Val Pro Phe Val Arg Cys Ser Gly Ser Asp Arg Val 
150 155 160 

ATT GCC TCG GGG AGA CTT TTC ATC GAA GGT TTC CAG AGC GCC AAA GTG 
879 

He Ala Ser Gly Arg Leu Phe He Glu Gly Phe Gin Ser Ala Lys Val 
165 170 175 
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CTG GAT CCT CAT TCA GAC AAG CAT GAC GCT CCT CCC ACG ATC AAC GTG 
927 

Leu Asp Pro His Ser Asp Lys His Asp Ala Pro Pro Thr He Asn Val 
180 185 190 195 

ATC ATC GAG GAG GGT CCG TCC TAG AAT AAC ACG CTC GAC ACC GGC AGC 
975 

He He Glu Glu Gly Pro Ser Tyr Asn Asn Thr Leu Asp Thr Gly Ser 

200 205 210 

TGT CCA GTC TTT GAG GAC AGC AGC GGG GGA CAT GAC GCA CAG GAA AAG 
1023 

Cys Pro Val Phe Glu Asp Ser Ser Gly Gly His Asp Ala Gin Glu Lys 
215 220 225 

TTC GCA AAG CAA TTC GCA CCA GCT ATC CTG GAA AAG ATC AAG GAC CAT 
1071 

Phe Ala Lys Gin Phe Ala Pro Ala He Leu Glu Lys He Lys Asp His 
230 235 240 

CTT CCC GGC GTG GAC CTG GCC GTG TCG GAT GTA CCG TAC TTG ATG GAC 
1119 

Leu Pro Gly Val Asp Leu Ala Val Ser Asp Val Pro Tyr Leu Met Asp 
245 250 255 

TTG TGT CCG TTT GAG ACC TTG GCT CGC AAC CAC ACA GAC ACG CTG TCT 
1167 

Leu Cys Pro Phe Glu Thr Leu Ala Arg Asn His Thr Asp Thr Leu Ser 
260 265 270 275 

CCG TTC TGC GCT CTT TCC ACG CAA GAG GAG TGG CAA GCA TAT GAC TAC 
1215 

Pro Phe Cys Ala Leu Ser Thr Gin Glu Glu Trp Gin Ala Tyr Asp Tyr 

280 285 290 

TAC CAA AGT CTG GGG AAA TAC TAT GGC AAT GGC GGG GGT AAC CCG TTG 
1263 

Tyr Gin Ser Leu Gly Lys Tyr Tyr Gly Asn Gly Gly Gly Asn Pro Leu 
295 300 305 

GGG CCA GCC CAA GGC GTG GGG TTT GTC AAC GAG TTG ATT GCT CGC ATG 
1311 

Gly Pro Ala Gin Gly Val Gly Phe Val Asn Glu Leu He Ala Arg Met 
310 315 320 

ACC CAT AGC CCT GTC CAG GAC TAC ACC ACG GTC AAC CAC ACT CTT GAC 
1359 

Thr His Ser Pro Val Gin Asp Tyr Thr Thr Val Asn His Thr Leu Asp 
325 330 335 

TCG AAT CCG GCG ACA TTC CCT TTG AAC GCG ACG CTG TAC GCA GAT TTC 
1407 

Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr Ala Asp Phe 
340 345 350 355 

AGC CAC GAC AAC ACA ATG ACG TCA ATT TTC GCG GCC TTG GGC CTG TAC 
1455 

Ser His Asp Asn Thr Met Thr Ser He Phe Ala Ala Leu Gly Leu Tyr 

360 365 370 

AAC GGG ACC GCG AAG CTG TCC ACG ACC GAG ATC AAG TCC ATT GAA GAG 
1503 

Asn Gly Thr Ala Lys Leu Ser Thr Thr Glu He Lys Ser He Glu Glu 
375 380 385 

ACG GAC GGC TAC TCG GCG GCG TGG ACC GTT CCG TTC GGG GGG CGA GCC 
1551 

Thr Asp Gly Tyr Ser Ala Ala Trp Thr Val Pro Phe Gly Gly Arg Ala 
390 395 400 . 
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TAT ATC GAG ATG ATG CAG TGT GAT GAT TCG GAT GAG CCA GTC GTT CGG 
1599 

Tyr He Glu Met Met Glxi Cys Asp Asp Ser Asp Glu Pro val Val Arg 

405 410 415 

GTG CTG GTC AAC GAC CGG GTG GTG CCA CTG CAT GGC TGC GAG GTG GAC 
1647 

Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys Glu Val Asp 
420 425 430 435 

TCC CTG GGG CGA TGC AAA CGA GAC GAC TTT GTC AGG GGA CTG AGT TTT 
1695 

Ser Leu Gly Arg Cys Lys Arg Asp Asp Phe Val Arg Gly Leu Ser Phe 

440 445 450 

GCG CGA CAG GGT GGG AAC TGG GAG GGG TGT TAC GCT GCT TCT GAG 
1740 

Ala Arg Gin Gly Gly Asn Trp Glu Gly Cys Tyr Ala Ala Ser Glu 
455 460 465 

TAGGTTTATT CAGCGAGTTT CGACCTTTCT ATCCTTCAAA CACTGCACAA AGACACACTG 
1800 

CATGAAATGG TAACAGGCCT GGAGCGTTTT AGAA6GAAAA AAGTT 
SO 1845 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 466 amino acids 
2s (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Leu Leu Leu Leu Val Leu Ser Gly Gly Leu Val Ala Leu Tyr 
30 1 5 10 15 

Val Ser Arg Asn Pro His Val Asp Ser His Ser Cys Asn Thr Val Glu 

20 25 30 



35 



Gly Gly Tyr Gin Cys Arg Pro Glu He Ser His Ser Trp GXy Gin Tyr 
35 40 45 

Ser Pro Phe Phe Ser Leu Ala Asp Gin Ser Glu He Ser Pro Asp Val 
50 55 60 

Pro Gin Asn Cys Lys He Thr Phe Val Gin Leu Leu Ser Arg His Gly 
65 70 75 80 

40 Ala Arg Tyr Pro Thr Ser Ser Lys Thr Glu Leu Tyr Ser Gin Leu He 

85 90 95 

Ser Arg He Gin Lys Thr Ala Thr Ala Tyr Lys Gly Tyr Tyr Ala Phe 
100 105 110 

Leu Lys Asp Tyr Arg Tyr Gin Leu Gly Ala Asn Asp Leu Thr Pro Phe 
45 115 120 125 

Gly Glu Asn Gin Met He Gin Leu Gly He Lys Phe Tyr Asn His Tyr 
130 135 140 



SO 



Lys Ser Leu Ala Arg Asn Ala Val Pro Phe val Arg Cys Ser Gly Ser 
145 150 155 160 

Asp Arg Val He Ala Ser Gly Arg Leu Phe He Glu Gly Phe Gin Ser 

165 170 175 
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Ala Lys Val Leu Asp Pro His Ser Asp Lys His Asp Ala Pro Pro Thr 
160 185 190 

lie Asn Val lie lie Glu Glu Gly Pro Ser Tyr Asn Asn Thr Leu Asp 
195 200 205 

Thr Gly Ser Cys Pro Val Phe Glu Asp Ser Ser Gly Gly His Asp Ala 
210 215 220 

Gin Glu Lys Phe Ala Lys Gin Phe Ala Pro Ala He Leu Glu Lys lie 
225 230 235 240 

Lys Asp His Leu Pro Gly Val Asp Leu Ala Val Ser Asp Val Pro Tyr 

245 250 255 

Leu Met Asp Leu Cys Pro Phe Glu Thr Leu Ala Arg Asn His Thr Asp 
260 265 270 

Thr Leu Ser Pro Phe Cys Ala Leu Ser Thr Gin Glu Glu Trp Gin Ala 
275 280 285 

Tyr Asp Tyr Tyr Gin Ser Leu Gly Lys Tyr Tyr Gly Asn Gly Gly Gly 
290 295 300 

Asn Pro Leu Gly Pro Ala Gin Gly Val Gly Phe Val Asn Glu Leu lie 
305 310 315 320 

Ala Arg Met Thr His Ser Pro Val Gin Asp Tyr Thr Thr Val Asn His 

325 330 335 

Thr Leu Asp Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Leu Tyr 
25 340 345 350 

Ala Asp Phe Ser His Asp Asn Thr Met Thr Ser He Phe Ala Ala Leu 
355 360 365 



IS 



SO 



30 



Gly Leu Tyr Asn Gly Thr Ala Lys Leu Ser Thr Thr Glu He Lys Ser 
370 375 380 

He Glu Glu Thr Asp Gly Tyr Ser Ala Ala Trp Thr Val Pro Phe Gly 
385 390 395 400 

Gly Arg Ala Tyr He Glu Met Met Gin Cys Asp Asp Ser Asp Glu Pro 

405 410 415 

3S Val Val Arg Val Leu Val Asn Asp Arg Val Val Pro Leu His Gly Cys 

420 425 430 

Glu Val Asp Ser Leu Gly Arg Cys Lys Arg Asp Asp Phe Val Arg Gly 
435 440 445 

Leu Ser Phe Ala Arg Gin Gly Gly Asn Trp Glu Gly Cys Tyr Ala Ala 
^ 450 455 460 

Ser Glu 
465 

(2) INFORMATION FOR SEQ ID NO: 5: 

^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 
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(A) NAME/ KEY: CDS 

(B) LCX:ATION:join(43..89, 147.. 1494) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGATTCAACG ACGGAGGAAT CGCAACCCTA ATTGTCGGTA TC ATG GTG ACT CTG 
54 

Met Val Thr Leu 
1 

ACT TTC CTG CTT TCG GCG GCG TAT CTG CTT TCT GG GTGAGTGGCT 
99 

Thr Phe Leu Leu Ser Ala Ala Tyr Leu Leu Ser Gly 
5 10 15 

TGGATCTATT GCTCGGATAG GGCTGTGGTG CTGATTCTGA AACGGAG T AGA GTG 
153 

Arg Val 



TCT 


GCG 


GCA 


201 






Ser 


Ala 


Ala 




20 




CTC 


GGG 


TAC 


249 






Leu 


Gly 


Tyr 


35 






TCG 


CCA 


TTC 


297 






Ser 


Pro 


Phe 


CCC 


AAG 


GAT 


345 






Pro 


Lys 


Asp 


GCG 


CGG 


TAC 


393 






Ala 


Arg 


Tyr 






85 


AC6 


GCG 


ATC 


441 






Thr 


Ala 


He 



25 30 



40 45 50 



55 60 65 



70 75 80 



90 95 



100 105 110 

TTG AAG ACQ TAC AAC TAT ACT CTG GGT GCG GAT GAC CTC ACT CCC TTT 
489 

Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr Pro Phe 
115 120 125 130 



GGG 

537 
Gly 


GAG 

Glu 


CAG 

Gin 


CAG 

Gin 


CTG 

Leu 
135 


GTG 

val 


AAC 

Asn 


TCG 

Ser 


GGC 
Gly 


ATC 

He 
140 


AAG 

Lys 


TTC 

Phe 


TAC CAG 

Tyr Gin 


AGK3 

Arg 
145 


AAG 
585 
Lys 


GCT 


CTG 


GCG 


CGC 


AGT 


GTG 


OTG 


CCG 


TTT 


ATT 


CGC 


GCC TCA 


GGC 


Ala 


Leu 


Ala 

150 


Arg 


Ser 


Val 


Val 


Pro 
155 


Phe 


He 


Arg 


Ala Ser 
160 


Gly 


GAC 

633 
Asp 


CGG 
Arg 


GTT 

Val 
165 


ATT 
He 


GCT 
Ala 


TCG 
Ser 


GGA 
Gly 


GAG 

Glu 
170 


AAG 

Lys 


TTC 
Phe 


ATC GAG GGG TTC 

He Glu Gly Phe 
175 


CAG 
Gin 
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10 



GCG AAG CTG GCT GAT CCT GGC GCG ACG AAC CGC GCC GCT CCG GCG ATT 
681 

Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala Ala Pro Ala lie 
180 185 190 

ACT GTG ATT ATT CCG GAG AGO GAG ACG TTC AAC AAT ACG CTG GAC CAC 
729 

Ser Val He He Pro 61u Ser Glu Thr Phe Asn Asn Thr Leu. Asp His 
195 200 205 210 

GGT GTG TGC ACG AAG TTT GAG GCG ACT CAG CTG GGA GAT GAG GTT GCG 
777 

Gly Val Cys Thr Lys Phe Glu Ala Ser Gin Leu Gly Asp Glu Val Ala 

215 220 225 

GCC AAT TTC ACT GCG CTC TTT GCA CCC GAC ATC CGA GCT CGC GCC GAG 
825 

15 Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp He Arg Ala Arg Ala Glu 

230 235 240 

AAG CAT CTT CCT GGC GTG ACG CTG ACA GAC GAG GAC GTT GTC AGT CTA 
873 

Lys His Leu Pro Gly Val Thr Leu Thr Asp Glu Asp Val Val Ser Leu 
245 250 255 

20 

ATG GAC ATG TGT TC6 TTT GAT ACG GTA GCG CGC ACC AGC GAC GCA AGT 
921 

Met Asp Met Cys Ser Phe Asp Thr Val Ala Arg Thr Ser Asp Ala Ser 
260 265 270 

CAG CTG TCA CCG TTC TGT CAA CTC TTC ACT CAC AAT GAG TGG AAG AAG 
2S 969 

Gin Leu Ser Pro Phe Cys Gin Leu Phe Thr His Asn Glu Trp Lys Lys 

275 280 285 290 

TAC AAC TAG CTT CAG TCC TTG GGC AAG TAG TAC GGC TAC GGC GCA GGC 

1017 

Tyr Asn Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly Tyr Gly Ala Gly 
30 295 300 305 

AAC CCT CTG GGA CCG GCT CAG GGG ATA GGG TTC ACC AAC GAG CTG ATT 

1065 

Asn Pro Leu Gly Pro Ala Gin Gly He Gly Phe Thr Asn Glu Leu He 
310 315 320 

^ GCC CGG TTG ACT CGT TOG CCA GTG CAG GAC CAC ACC AGC ACT AAC TCG 

1113 

Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr Ser Thr Asn Ser 
325 330 335 

ACT CTA GTC TCC AAC CCG GCC ACC TTC CCG TTG AAC GCT ACC ATG TAC 

40 11^1 

Thr Leu Val Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr Met Tyr 

340 345 350 

GTC GAC TTT TCA CAC GAC AAC AGC ATG GTT TCC ATC TTC TTT GCA TTG 

1209 

Val Asp Phe Ser His Asp Asn Ser Met Val Ser He Phe Phe Ala Leu 

45 355 360 365 370 

GGC CTG TAC AAC GGC ACT GAA CCC TTG TCC CGG ACC TCG GTG GAA AGC 
1257 

Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr Ser Val Glu Ser 

375 380 385 

SO GCC AAG GAA TTG GAT GGG TAT TCT GCA TCC TGG GTG GTG CCT TTC GGC 

1305 

Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val Val, Pro Phe Gly 
390 395 400 
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GCG CGA GCC TAC TTC GAG ACG ATG CAA TGC AAG TCG GAA AAG GAG CCT 

1353 

Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser Glu Lys Glu Pro 
405 410 415 

CTT GTT CGC GCT TTG ATT AAT GAG CGG GTT GTG CCA CTG CAT GGC TGC 

1401 

Leu Val Arg Ala Leu lie Asn Asp Arg Val Val Pro Leu His Gly Cys 
420 425 430 

GAT GTG GAC AAG CTG GGG CGA TGC AAG CTG AAT GAC TTT GTC AAG GGA 
1449 

Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp Phe Val Lys Gly 
435 44C 445 450 

TTG ACT TGG GCC AGA TCT GGG GGC AAC TGG GGA GAG TGC TTT AGT 
1494 

Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu Cys Phe Ser 

455 460 465 

TGAGATGTCA TTGTTATGCT ATACTCCAAT AGACCGTTGC TTAGCCATTC ACTTCACTTT 
1554 

GCTCGAACCG CCTGCCG 
1571 

(2) INFORMATION FOR SEQ 10 NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{11} MOLECULE TYPE: procein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Val Thr Leu Thr Phe Leu Leu Ser Ala Ala Tyr Leu Leu Ser Gly 
15 10 15 

Arg Val Ser Ala Ala Pro Ser Ser Ala Gly Ser Lys Ser Cys Asp Thr 

20 25 30 

Val Asp Leu Gly Tyr Gin Cys Ser Pro Ala Thr Ser His Leu Trp Gly 
35 40 45 

Gin Tyr Ser Pro Phe Phe Ser Leu Glu Asp Glu Leu Ser Val Ser Ser 
50 55 60 

Lys Leu Fro Lys Asp Cys Arg lie Thr Leu Val Gin Val Leu Ser Arg 
65 70 75 80 

His Gly Ala Arg Tyr Pro Thr Ser Ser Lys Ser Lys Lys Tyr Lys Lys 

85 90 95 

Leu Val Thr Ala lie Gin Ala Asn Ala Thr Asp Phe Lys Gly Lys Phe 
100 105 110 

Ala Phe Leu Lys Thr Tyr Asn Tyr Thr Leu Gly Ala Asp Asp Leu Thr 
lis 120 125 

Pro Phe Gly Glu Gin Gin Leu Val Asn Ser Gly lie Lys Phe Tyr Gin 
130 135 140 

Arg Tyr Lys Ala Leu Ala Arg Ser Val Val Pro Phe lie Arg Ala Ser 
145 150 155 160 

Gly Ser Asp Arg Val lie Ala Ser Gly Glu Lys Phe lie Glu Gly Phe 

165 170 175 
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10 



Gin Gin Ala Lys Leu Ala Asp Pro Gly Ala Thr Asn Arg Ala Ala Pro 
180 185 190 

Ala lie Ser Val lie lie Pro Glu Ser Glu Thr Phe Asn Asn Thr Leu 
195 200 205 

Asp His Gly Val Cys Thr Lys Phe Glu Ala Ser Gin Leu Gly Asp Glu 
210 215 220 

Val Ala Ala Asn Phe Thr Ala Leu Phe Ala Pro Asp lie Arg Ala Arg 
225 230 235 240 

Ala Glu Lys His Leu Pro Gly Val Thr Leu Thr Asp Glu Asp Val Val 

245 250 255 

Ser Leu Met Asp Met Cys Ser Phe Asp Thr Val Ala Arg Thr Ser Asp 
260 265 270 

Ala Ser Gin Leu Ser Pro Phe Cys Gin Leu Phe Thr His Asn Glu Trp 
275 280 285 

Lys Lys Tyr Asn Tyr Leu Gin Ser Leu Gly Lys Tyr Tyr Gly Tyr Gly 
290 295 300 

an Ala Gly Asn Pro Leu Gly Pro Ala Gin Gly lie Gly Phe Thr Asn Glu 

^ 305 310 315 320 

Leu He Ala Arg Leu Thr Arg Ser Pro Val Gin Asp His Thr Ser Thr 

325 330 335 



75 



25 



30 



Asn Ser Thr Leu Val Ser Asn Pro Ala Thr Phe Pro Leu Asn Ala Thr 
340 345 350 

Met Tyr Val Asp Phe Ser His Asp Asn Ser Met Val Ser He Phe Phe 
355 360 365 

Ala Leu Gly Leu Tyr Asn Gly Thr Glu Pro Leu Ser Arg Thr Ser Val 
370 375 380 

Glu Ser Ala Lys Glu Leu Asp Gly Tyr Ser Ala Ser Trp Val Val Pro 
385 390 395 400 

Phe Gly Ala Arg Ala Tyr Phe Glu Thr Met Gin Cys Lys Ser Glu Lys 

405 410 415 

35 Glu Pro Leu Val Arg Ala Leu He Asn Asp Arg Val Val Pro Leu His 

420 425 430 

Gly Cys Asp Val Asp Lys Leu Gly Arg Cys Lys Leu Asn Asp Phe Val 
435 440 445 

Lys Gly Leu Ser Trp Ala Arg Ser Gly Gly Asn Trp Gly Glu Cys Phe 
40 450 455 460 

Ser 
465 

(2) INFORMATION FOR SEQ ID NO: 7: 

4S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



SO 



(ii) MOLECULE TYPE: DNA (genomic) 
( ix) FEATURE : 
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(A) NAME/KEY: CDS 

(B) LOCATION: join(78. .124, 177.. 1527) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACGTCCCAGG TCGGGGACTA CATCCGCTAT GTGGTCCTCT ACTTCGTCGG AAGAATATAC 
60 

TGTCTCTTGT GGCTACC ATG OGG GTT TTC GTC GTT CTA TTA TCT ATC GCG 
110 

Met Gly Val Phe Val Val Leu Leu Ser He Ala 
15 10 

ACT CTG TTC GGC AG GTATGTGCAC CGCTCTAGGT TCAACTCGCC TGGTAACTGA 
164 



Thr 


Leu 


Phe Gly 
15 


Ser 






















CAAACAGCAC AG C 
213 


ACA 

Thr 


TCG GGC 
Ser Gly 


ACT GCG 

Thr Ala 
20 


CTG 
Leu 


GGC 

Gly 


CCC CGT 

Pro Arg 
25 


GGA AAT 

Gly Asn 


CAC 
His 


AGC 
261 
Ser 


GAC 

Asp 
30 


TGC 
Cys 


ACC 
Thr 


TCA 
Ser 


GTC GAC 

Val Asp 

35 


CGG GGG 
Arg Gly 


TAT 
Tyr 


CAA 
Gin 


TGC 

Cys 
40 


TTC 

Phe 


CCT 
Pro 


GAG 
Glu 


CTC 
Leu 


TCC 
309 
Ser 
45 


CAT 

His 


AAA 

Lys 


TGG 
Trp 


GGT 
Gly 


CTC 

Leu 
50 


TAC 

Tyr 


GCG CCC 
Ala Pro 


TAT 
Tyr 


TTC 

Phe 
55 


TCC CTC 
Ser Leu 


CAG GAT 
Gin Asp 


GAA 

Glu 
60 


TCT 
357 
Ser 


CCG 
Pro 


TTT 
Phe 


CCT 
Pro 


CTG 

Leu 
65 


GAC GTC 
Asp Val 


CCG GAT 
Pro Asp 


GAC 

Asp 
70 


TGC 
Cys 


CAC 
His 


ATC 
lie 


ACC 

Thr 


TTT 

Phe 
75 


GTG 
Val 


CAG 
405 
Gin 


GTG 
Val 


CTG 
Leu 


GCC 

Ala 
80 


CGA 
Arg 


CAT 
His 


GGA 
Gly 


GCG CGG 

Ala Arg 
85 


TCT 
Ser 


CCA 
Pro 


ACC 

Thr 


GAT 
Asp 


AGC 

Ser 
90 


AAG 
Lys 


ACA 
Thr 


AAG 
453 
Lys 


GCG 
Ala 


TAT 

Tyr 
95 


GCC 
Ala 


GCG 
Ala 


ACT ATT 
Thr He 


GCA GCC 

Ala Ala 
100 


ATC 

lie 


CAG 
Gin 


AAG 
Lys 


AAT 

Asn 
105 


GCC 
Ala 


ACC 
Thr 


GCG 
Ala 


TTG 
501 
Leu 


CCG 

Pro 
110 


GGC 
Gly 


AAA 

Lys 


TAC 

lyr 


GCC 
Ala 


TTC CTG AAG 

Phe Leu Lys 
115 


TCG 

Ser 


TAC 
Tyr 


AAT 

Asn 
120 


TAC 
Tyr 


TCC 
Ser 


ATG 
Met 


GGC 
Gly 


TCC 
549 
Ser 
125 


GAG 
Glu 


AAC 
Asn 


CTG 
Leu 


AAC 
Asn 


CCC 

Pro 
130 


TTC 
Phe 


GGG CGG 
Gly Arg 


AAC 

Asn 


CAA 

Gin 
135 


CTG 

Leu 


CAA 

Gin 


GAT 

Asp 


CTG 
Leu 


GGC 

Gly 
140 


GCC 
597 
Ala 


CAG 
Gin 


TTC 
Phe 


TAG 
Tyr 


CGT 

Arg 
145 


CGC 
Arg 


TAC 
Tyr 


GAC ACC 
Asp Thr 


CTC 

Leu 
150 


ACC 

Thr 


CGG 
Arg 


CAC 

His 


ATC 

He 


AAC 

Asn 
155 


CCT 
Pro 


TTC 
645 
Phe 


GTC 
Val 


CGG 
hrg 


GCC 

Ala 
160 


OCG 
Ala 

• 


GAT 
Asp 


TCC 
Ser 


TCC CGC 

Ser Arg 
165 


GTC 
Val 


CAC 

His 


GAA 

Glu 


TCA 
Ser 


GCC 

Ala 
170 


GAG 

Glu 


AAG 

Lys 
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TTC 


GTC 


GAG 


GGC 


TTC 


CAA 


AAC 


GCC 


CGC 


CAA 


GGC 


GAT 


CCT 


CAC 


GCC 


AAC 




693 


























His 


Ala 






Phe 


Val 


Glu 


Gly 


Phe 


Gin 


Asn 


Ala 


ATg 


ULn 


\»xy 




rro 


As 11 








175 








1 OA 
loU 


















5 


CCT 


CAC 


CAG 


CCG 


TCG 


CCG 


CGC 


GTG 


GAT 


GTA 


GTC 


ATC 


CCC 


GAA GGC 


ACC 




741 


























Glu 


Gly 






Pro 


His 


Gin 


Pro 


Ser 


Pro 




vax 


ASp 


vai 












190 










195 










T Art 












GCC 


TAC 


AAC 


AAC 


ACG 


CTC 


GAG 


CAC 


AGC 


ATC 


TGC 


ACC 


GCC 


TTC 


GAG 


GCC 


10 


789 




























Glu 


AX a 




Ala 


Tyr 


Asn 


Asn 


Thr 


Leu 


Glu 


His 


Ser 


lie 


Cys 


Tnr 


Hi a 


Phe 




205 








210 










215 














AGC 


ACC 


GTC 


GGC 


GAC 


GCC 


GCG 


GCA 


GAC 


AAC 


TTC 


ACT 


GCC 


GTG 


TTC 


GCG 




837 






























Al m 


IS 


Ser 


Thr 


Val 


Gly 


Asp 


Ala 


Ala 


Axa 


Asp 


Asn 


irne 






Val 


Phe 








225 




















235 






CCG 


CCG 


ATC 


GCC 


AAG 


CGT 


CTG 


GAG 


GCC 


GAT 


CTG 


ccc 


GGC 


GTG 


CAG 


CTG 




885 
























Giy 


Val 


Gin 






Pro 


Ala 


He 


Ala 


Lys 


m ^ 

AT? 


T Asa 

Leu 


Glu 


Ala 


Asp 


Lieu 


D V* A 

rro 










240 








245 










250 






20 




GCC 


GAC 


GAC 


GTG 


GTC 


AAT 


CTG 


ATG 


GCC 


ATG 


TGT 


CCG 


TTC 


GAG 


ACG 




933 




























Glu 


TAT 




Ser 


Ala 


Asp 


Asp 


Val 


Val 


Asn 


Leu 


Met 


Ala 


Met 


Cys 


Pro 


Phe 








255 








260 










265 










GTC 


AGC 


CTG 


ACC 


GAC 


GAC 


GCG 


CAC 


ACG 


CTG 


TCG 


CCG 


TTC 


TGC 


GAC 


CTC 




981 
































25 


Val 


Ser 


Leu 


Thr 


Asp 


Asp 


Ala 


HX8 


Thr 


Leu 


Ser 


Pro 


Phe 


Cys 


Asp 


Leu 






270 










275 










280 












TTC 


ACC 




arc 






ACG 


CAG 


TAC 


AAC 


TAC 


CTG 


CTC 


TCG 


CTG 


GAC 




1029 
























Ser 








Phe 


Thr 


Ala 


Ala 


Glu 


Trp 


Thr 


Gin 


Tyr 


Asn 


Tyr 


Leu 


Leu 


Leu 


Asp 


30 


285 










290 










295 










300 




AAG 


TAC 


TAC 


GGC 


TAC 


GGC 


GGC 


GGC 


AAT 


CCG 


CTG 


GGC 


CCC 


GTG 


CAG 


GGC 




1077 
























val 


Gin 


Gly 




Lya 


Tyr 


Tyr 


Gly 


Tyr 


Gly 


Gly 


Gly 


Asn 


Pro 


Leu 


Gly 


Pro 






305 










310 










315 




35 


GTG 


66C 


TGG 


GCG 


AAC 


GAG 


CTG 


ATC 


GCG 


CGG 


CTG 


ACG 


CGC 


TCC 


CCC 


GTC 




1125 




























val 




Val 


Gly 


Trp 


Ala 


Asn 


Glu 


Leu 


He 


Ala 


Arg 


Leu 


Thr 


Arg 


Ser 


Pro 






320 










325 










330 








CAC 


GAC 


CAC 


ACC 


TGC 


GTC 


AAC 


AAC 


ACC 


CTC 


GAC 


GCC 


AAC 


CCG 


GCC 


ACC 




1173 


























Ala 




40 


His 


Asp 


His 


Thr 


Cys 


Val 


Asn 


Asn 


Thr 


Leu 


Asp 


Ala 


Asn 


Pro 


Thr 




335 








340 










345 










TTC 


CCG 


CTG 


AAC 


GCC 


ACC 


CTC 


TAC 


GCG 


GAC 


TTT 


TCG 


CAC 


GAC 


AGT 


AAC 




1221 
































Phe 


Pro 


Leu 


Asn 


Ala 


Thr 


Leu 


Tyr 


Ala 


Asp 


Phe 


Ser 


His 


Asp 


Ser 


Asn 


46 




350 










355 










360 










CTG 


GTG 


TCG 


ATC 


TTT 


TGG 


GCG 


CTG 


GGT 


CTG 


TAC 


AAC 


GGC 


ACC 


AAG 


CCC 




1269 
































Leu 


Val 


Ser 


He 


Phe 


Trp 


Ala 


Leu 


Gly 


Leu 


Tyr 


Asn 


Gly 


Thr Lys 


Pro 




365 










370 










375 










380 




CTG 


TCG 


CAG 


ACC 


ACC 


GTG 


GAG 


GAT 


ATC 


ACC 


CGG 


ACG 


GAC 


GG6 


TAC 


GCG 


SO 


1317 




























Ala 


Leu 


Ser 


Gin 


Thr 


Thr 


Val 


Glu 


Asp 


lie 


Thr 


Arg 


Thr 


Asp 


Gly Tyr 












385 










390 










395 
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GCC GCC TGG ACG GTG CCG TTT GCC GCC CGC GCC TAG ATC GAG ATG ATG 
1365 

Ala Ala Trp Thr Val Pro Phe Ala Ala Arg ALa Tyr lie Glu Met Met 
400 405 410 

CAG TGT CGC GOG GAG AAG CAG CCG CTG GTG CGC GTG CTG GTC AAC GAG 
1413 

Gin Cys Arg Ala Glu Lys Gin Pro Leu Val Arg Val Leu Val Asn Asp 
415 420 425 

CGT GTC ATG CCG CTG CAC GGC TGC GCG GTG GAT AAT CTG GGC AGG TGT 
1461 

Arg Val Met Pro Leu His Gly Cya Ala Val Asp Asn Leu Gly Arg Cys 
430 435 440 

AAA CGG GAC GAC TTT GTG GAG GGA CTG AGC TTT GCG CGG GCA GGA GGG 
1509 

Lys Arg Asp Asp Phe Val Glu Gly Leu Ser Phe Ala Arg Ala Gly Gly 
445 450 455 460 

AAC TGG GCC GAG TGT TTC TGATGTACAT GCTGTAGTTA GCTTTGAGTC 

1557 

Asn Trp Ala Glu Cys Phe 

465 

CTGAGGTACC 
1567 



(2) INFORMATION FOR SSQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 8: 

Met Gly Val Phe Val Val Leu Leu Ser lie Ala Thr Leu Phe Gly Ser 
15 10 15 

Thr Ser Gly Thr Ala Leu Gly Pro Arg Gly Asn His Ser Asp Cys Thr 

20 25 30 

Ser Val Asp Arg Gly Tyr Gin Cys Phe Pro Glu Leu Ser His Lys Trp 
35 40 45 

Gly Leu Tyr Ala Pro Tyr Phe Ser Leu Gin Asp Glu Ser Pro Phe Pro 
50 55 60 

Leu Asp Val Pro Asp Asp Cys His lie Thr Phe Val Gin Val Leu Ala 
65 70 75 80 

Arg His Gly Ala Arg Ser Pro Thr Asp Ser Lys Thr Lys Ala Tyr Ala 

85 90 95 

Ala Thr lie Ala Ala He Gin Lys Asn Ala Thr Ala Leu Pro Gly Lys 
100 105 110 

Tvr Ala Phe Leu Lys Ser Tyr Asn Tyr Ser Met Gly Ser Glu Asn Leu 
115 120 125 

Asn Pro Phe Gly Arg Asn Gin Leu Gin Asp Leu Gly Ala Gin Phe Tyr 

130 135 140 

Arg Arg Tyr Asp Thr Leu Thr Arg His He Asn Pro Phe Val Arg Ala 
145 150 155 160 

Ala Asp Ser Ser Arg Val His Glu Ser Ala Glu Lys Phe* Val Glu Gly 
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10 



15 



20 



165 170 175 

Phe Gin Asn Ala Arg Gin Gly Asp Pro His Ala Asn Pro His Gin Pro 

180 185 190 

Ser Pro Arg Val Asp Val Val lie Pro Glu Gly Thr Ala Tyr Asn Asn 
195 200 205 

Thr Leu Glu His Ser lie Cys Thr Ala Phe Glu Ala Ser Thr Val Gly 
210 215 220 

Asp Ala Ala Ala Asp Asn Phe Thr Ala Val Phe Ala Pro Ala lie Ala 
225 230 235 240 

Lys Arg Leu Glu Ala Asp Leu Pro Gly Val Gin Leu Ser Ala Asp Asp 

245 250 255 

Val Val Asn Leu Met Ala Met Cys Pro Phe Glu Thr Val Ser Leu Thr 

260 265 270 

Asp Asp Ala His Thr Leu Ser Pro Phe Cys Asp Leu Phe Thr Ala Ala 
275 280 285 

Glu Trp Thr Gin Tyr Asn Tyr Leu Leu Ser Leu Asp Lys Tyr Tyr Gly 
290 295 300 

25 Tyr Gly Gly Gly Asn Pro Leu Gly Pro Val Gin Gly Val Gly Trp Ala 

305 310 315 320 

Asn Glu Leu lie Ala Arg Leu Thr Arg Ser Pro Val His Asp His Thr 

325 330 335 

30 Cys Val Asn Asn Thr Leu Asp Ala Asn Pro Ala Thr Phe Pro Leu Asn 

340 345 350 

Ala Thr Leu Tyr Ala Asp Phe Ser His Asp Ser Asn Leu Val Ser lie 
355 360 365 

35 Phe Trp Ala Leu Gly Leu Tyr Asn Gly Thr Lys Pro Leu Ser Gin Thr 

370 375 380 

Thr Val Glu Asp lie Thr Arg Thr Asp Gly Tyr Ala Ala Ala Trp Thr 
385 390 395 400 

40 Val Pro Phe Ala Ala Arg Ala Tyr He Qlu Met Met Gin Cys Arg Ala 

405 410 415 

Glu Lys Gin Pro Leu Val Arg Val Leu Val Asn Asp Arg Val Met Pro 

420 425 430 

45 Leu His Gly Cys Ala Val Asp Asn Leu Gly Arg Cys Lys Arg Asp Asp 

435 440 445 

Phe Val Glu Gly Leu Ser Phe Ala Arg Ala Gly Gly Asn Trp Ala Glu 
450 455 460 

Cys Phe 
465 



55 



Claims 

1 . A process for the production of a modified phytase with improved activity properties characterized therein that the 
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following steps are effected: 

a) the three dimensional structure of the phytase to be modified and. optionally of another phytase with activity 
properties which are more favorable than the ones of the phytase to be modified is/are computer modelled on 

5 the basis of the three dimensional structure of the phytase of Aspergillus niger: 

b) the structure of the active sites of the phytase to be nradlfled and of the phytase with the more favorak)le 
activity properties are compared and those amino acid residues in both active sites which are different are 
Identified; 

10 

c) a DNA sequence coding for a modified phytase Is constructed by changing the nucleotides coding for at 
least one of the amino acicte by which txrth active sites differ; 

d) integrating such a DNA sequence into a vector capable of expression in a suitable host cell; 

IS 

e) transfornting a suitable host cell by the DNA sequence of c) or the vector of d), growing said host cell under 
suitable growth conditions and isolating the modified phytase from the host cell or the culture medium by meth- 
ods known in the state of the art. 

20 2. A process as claimed in claim 1 wherein the phytase to be modified is of eukaryotic origin. 

3. A process as claimed in daim 1 or 2 wherein the phytase to be modified is of fungal origin. 

4. A process as daimed in any one of daims 1 to 3 wherein the phytase to be modified is of Aspergillus origin. 

25 

5. A process as daimed in any one of daims 1 to 4 wherein the phytase to be modified is a phytase from Aspergillus 
fumigatus. 

6. A process as claimed in any one of claims 1 to 5 wherein the phytase with the more favorable activity properties is 
da of eukaryotic origin. 

7. A process as daimed In any one of claim 1 to 6 wherein the phytase with the more favorable activity properties is 
of fungal origin. 

35 8. A process darned In any one of daims 1 to 7 wherein the phytase with the more favorable activity properties is of 
Aspergillus origin. 

9. A process as claimed In any one of claims 1 to 8 wherein the phytase with the more favorable activity properties is 
a phytase from Aspergillus ten-eus. 

40 

10. A process as daimed in any one of claims 1 to 9 wherein the phytase to be modified is a phytase of Aspergillus 
fumigatus and the phytase with the more favorable activHy properties is the Aspergillus niger phytase. 

11. A process as daimed in any one of claims 1 to 9 wherein the phytase to be modified is a phytase of Aspergillus 
45 fumigatus and the phytase with the more favorable activity properties is the Aspergillus terreus phytase. 

12. A modified phytase obtained or obtainable by a process as claimed in any one of daims 1 to 11 . 

1 3. A phytase which has been modified in a way that its activity property is more favorable than the one of the non-mod- 
so ified phytase. 

14. A phytase according to daim 13 characterized therein that the amino add s^uence of the non-nxxiified phytase 
has been changed deletion, substitution and/or addition tiy one or more amino acids. 

55 15. A phytase according to any one of claims 13 or 14 wherein changes have been made at at least one position which 
is homologous to one of the fbllowing positions of tiie amino acid sequence of the phytase of Aspergillue niger: 27, 
66, 71. 103. 140, 141. 188, 205, 234, 235, 238, 274, 277. 282. 340 and/or 424, preferably 27, 66. 140. 205. 274, 
277. 282 and/or 340. 
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16. A phytase aocoirilng to any one of claims 13 to 15 which is the phytase of Aspergillus, preferably Aspergillus fumi- 
gatus. 

1 7. A phytase according to any one of clainfis 1 5 or 1 6 wherein at position 27 or at least at position 27 a change occurs. 

5 

1 8. A phytase according to claim 1 7 wherein the amino acid at position 27 is replaced by one selected from one of the 
Ibllowing groips: 

a) Ala, Val. Leu» lie; or 

10 

b) Thr. 

19. A phytase according to claim 17 or 18 wherein in ^ition to position 27 a change occurs also at position 66. 
15 20. A phytase according to claim 17 or 18 wherein in addition to position 27 a change occurs also at position 140. 

21 . A phytase according to any one of daims 1 3 to 20 which is characterized by at least one of the Ibllowing mutations: 
Q27U Q27N. Q27T. Q27I. Q27V, Q27A. Q27Q. S66D. S140Y, D141G. A205E. Q274U G277D. Q277K. Y282H 

and/or N340S. 

so 

22. A DNA sequence comprising a DMA sequence coding for a phytase as claimed in any one of claims 12 to 21 . 

23. A vector comprising the DNA sequence of claim 22. 
25 24. The vector of claim 23 which is an expression vector. 

25. A host cell which has been transionned by a DNA sequence as claimed in claim 22 or the vector of claim 23 or 24. 

26. A process for the preparation of a phytase as claimed in any one of claims 12 to 21 wherein the host cell as claimed 
30 in daim 25 is cultured under suitable culture conditions and the phytase is isolated from such host cells or the cul- 
ture medium by methods known in the art . 

27. A flood or feed composition comprising a phytase as claimed in any one of claims 12 to 21 . 

35 
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yiwire 4/1 



L TCTGTAACCCATAGCGGACCGACrAGCC VLCG L rGA TCCACAATATCrCA 

• • • • • 

SI GACAATC^IAACrCACrrCGAATATGAAC^ 

• • • • • 
10 1 GGCCGTCTAGGTCGCCCTCCCCGGATGAGGACGACCAGGCTCG'ra 

• • • • - 

MAFFTVALSLYVtLS 
. • • • • 

2Q L CGAGgcgagacccccacaacacccgcccgcccagccgaaccggcacct ac 
R 



• • • • • 

25 1 ccgcacagAGTCrcrCCTCACGCCCCAGT^ 300 

VSAQAPVVQMKSCM 30 



» • • • • 

30 L T^COSCCCKCCrtQ^^ 3 SO 

TAOGCrQCFPNVSKVWG 47 

• • • • • 

331 CTCAGTACTCGCCGTA C 1 IClCC ATCGACCAGGACTCAGCTATCrcrGAG 400 

QY5PYFSZSQSSAZSE €3 

• • • • • 

40L CACGTGCCrCAl Uj ClUiliAOCTrA LLi i ' lG I G CAGGTCCTCTCGCGGCA 450 

DVPKGCEVTFVQVZ.SRK 80 



• • • • • 

451 TGC-GGCTACGTATCCGACAGACTCGAAGAGT^^ SOO 
CARYPTESKS XAVSCLI 97 



• • « • " 

SOI TrGAACCAATCaVCAAGAATGCTACCi'Cr ill' i GCGCACACTAiU- i i' 111 550 
EAZQKITATSFWGQYAF 1X3 



m • m • • " 

S 3 1 CrGGAGACrrATAACtATACCCTCGGCGCGGATGACXTt^^ 600 
LESrNYTLGAOOLTIFG 130 



« • • ■ » 

€01 CGAGAACCAGATGGTTGATTCGGaTGCrAAGTrCTACCGACaCT^ 650 
EfcTQMVDSGAKFVRRYKM 147 



• • • " - 

631 ATCTCGCCaCGAAAAATA L I ' I ' I ' l ATCrGTCCATa^jGCTCTGACCGT 700 
LARKNTPFIRASGSOR 163 



SO 

100 

ISO 

200 
IS 

2S0 
16 
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Fitmre 4/2 



• • • • • 

701 GTCGrrGCGTCTGCGGACAAGrrCAITAAIGGAm 7 SO 

VVASASKFINGFRKAQL 130 

• • • • « 

75 L CCACGACCATGSCtCCAMCXTGCTACGCC^^ 800 

KDKGSKRATPVVNVrtP 197 



a • • • • 

SOL CTGAAATCGATCGGTrrAACAACACCCTGGACCATAC^ 8S0 
EIDGFNMTLOKSTCVS Z13 



• • • • • 

a S 1 TTTGAGAATGATGACCGQQCGGAtGAAATTGAASCCAATrTC^^ 900 
FEeiOCRA. OEXEANFTAX 230 



■ • • • • 

901 TATGGGACCTCCGATCCGCAAAarrcTQGAAAATGACC^ 950 
MGPPIRKRLENOLPGZK 247 



• • • • • 

95 1 AACTTACMUU^GACAATGTAATATATriGATGGA^ lOOO 
LTNSNVXVLMOMCS ?D 263 



• • • • • 

1001 xrr^Trsc^^^Arrrj^xrr^.^rrr.uTTcrr^^ lOSO 

TMARTAKGTELSPFCAI 280 



• • • • • 

lOSi rrrgAcrciwiAJUXSAgrgXKr^^ IIQO 

FTeRBW'LQYOYLQSLSK 297 



1101 ACTACT ACGGCTACGGTGCCGGAAGCCCCCTTGGCCCAGCTC^ llSO 
YYGYGAGS PLGPAQGZ 313 



1131 GGCTTCACCAACGAGCTGATTGCCCGACrAACGCAATCGCCC^ 1200 
GFTMBL IARLTQSPVQO 330 



1201 CAACACAAGCACCAACCACACrCTAGACTCGAACC LLGC 1250 

NTSTNHTL 0 SNPAT F P L 347 



• • • • • 

12S1 TCGACAG G AaCCTCTACGCCGACri' C T C CCACGACAATA^ 1300 

0RKLYA9FSKDef.SM^£S 3$3 

• • • * • • 

1301 ATATTCITCGCCATG GG T C TTTACAACGCCACCCAGCCGCTG^ 1350 
IFFAMGLYMGTQPIiSMO 330 
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FTtnire 4/3 

» • • • • 



13SL 


rrCCCTGGACTCCMrCAGGACATGGACGGTrACCkZGGCGrc^^ 
SVESIQEMDQYAASWTV 


140Q 
397 


L4QI 


• • • • a 

TTCCGTrrCGTGCGAGC^KrrrACITTGAGCT^ 

PFGARAYFELMQCEKK 


14SQ 
413 


I4SI 


♦ • • • • 
GAGCCGCllOlGCGGGTATrAGTGAATGATC^ iLLiCi'LCATGC 
EPLVRVLVMDRVVPLKG 


1500 
430 


ISOL 


• « • • • 

CTGCGCAgrTGACAAU ll'iljGACGGTGCACl'l loJ.GACGArrGGCTAGACG 
CAVO KFG R CTL D DWV E G 


1S50 
447 


X5SL 


« • • • • 

LKPARSGGKMKTCPTL 


WW 

463 


X€Ol 


• • • • • 

TAAACgGCUi.X'IGClCAITCATA>VGlGi^GrGCAGCrATAGG 


1650 


1651 


• • • • • 

GGAATTACCIGTriGGCrrrACTCTrArrAGACCA^^ 


1700 


1701 


• • • • • 

TrarcAACGCcrrcrACCATATorrcAA^^ 


1750 


1731 


• • • • • 

CATCrGTACGIXiAACCCGClCITGCATCAALC iLi'iU iXslVl UAGACTAC 


1800 


1801 


• • • • • 
TrrCACCAAACATATCCra/KTrCClX^^ 


1850 


18S1 


• • • • " 

TACAClti'taXiaCrATCTATATCGTCAACAAAACTACCACCCAAACACCAA 


1900 


1901 


• • • 
ATGTCACACil'iCCACCACGAAArnt.Tl'Ca 19 J 1 
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■ • • > ' 

1 TTccAcarrcAAACCCTCAcrGGcuTTTC^^ so 

• » • • • 

SI AACTGCCTGCITAICrTCXTCAGAaX^ 100 

• • • • • ' 
101 TGC\CCCATGACGGACGAACGCACCG CiCiL4 tU jCCTCCACGGACCCGG ISO 

( • • • ■ 

ISL AGGTCGACX;CXGATGAC<rrCCCGCCCTCGACGGC^ 200 

a a • • * 

201 C:AGTrG;wSATC7CGCrGCGAACCTCGACCGa^ 2S0 

• • • • * 

231 OU ' l CC r C G CCTTCC UtfSGAAGa AaU^LGC T GrG ACGATGAUlUl^ 300 

K S L L L 5 



• • • " " 

. 301 TGJ L GG ' lG Cr G TCCGGCGG O L ' lUi ILGCG I i ' A .TAgcacgccceccecccc .350 
LVLSGGLVALY 16 



• • • • » 

3 S I cggteacaccgccccccgccaacgcccccacaaccgaagTGTCrCAAGAA 400 

V S R H 20 



401 ATCCGCATCTTGATACCCACTCTTOCAATAC^^ 450 

PKVOSKSCNTVECGYQ 3S 

• • • • • 

4S1 TGTOCTCCACAAATCTCCCACTCCreGGGCCAGTAT^ SOO 

CRPEISKSWGQYSPFFS S3 



• • • ■ • » 

SOI CCTGGCACACCACTCGGACATCrCGCCAGATo^ 530 

LAOQSEISPDVPQNCKr 70 

• • ' • • • 

SSI TTAC G it. - iG r C CAG L - lWlUlCiaSl ' CA CGCCGCTAGATA^ 600 

TFVQLLSRHGARYPTS 86 

• • • • • 

601 TCCAAGACCGAKrrCTATTCGCAGCTCATC^^ 650 

SK TEL YSQLISa rQKTA 103 



• • • ' ' 

651 CACTGCCTACAAAGGCTACTATGCCTTCTTGAAACACTACA^^ 700 
TAYKGYYAFLKOYR'YQL 120 



• • • - 

701 TGGGAGCCAACCACCrGACGCCCITrGGGGAAAACC^ 750 
GAMDLTPFCSNQMIQt- 136 
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Figure 5/2 



7SX GGavrCAA C il ' Vi A rAACCATTACAAGACTCTCGCCACGAATGCCGTCCC 800 
CIKFYNKYKSLARtrAVP 1S3 



801 Arr ai 'rr CG iT G CrCCGGCTCTGATCGGGTCArTCCCT 830 
FVRCSGSDRVIASGRLF 170 



• • • « 

8 S I TCATCGAAl Ujil iXC ACAGCGCCAAAGZ GCTC< ;ATCCTCATTCAGAC^ 300 
£EG?QSAKVL09KSDK 186 



• • • • " 

901 CATGACGCTCCTCCCACGATCAACGTGATCATCGAGGACG^ SSQ 
KDAPPTtNVrXEEGPSY 203 



• • • • • 

9S 1 CAATAACACGCTCGACACCGGCACCTGTCCA b it i i iG AGGACAGCAGCG 1000 

MHTLDTGSCPVPEDSSG 220 

• • • • • 

100 1 GGGGACATXIACCCACACGAAAACrrCGCAAACCAATrCCC^^ 1050 

GKDAQEKFAKQFAPAI 2l€ 



• • • • • 

1051 CTGGAAAAGATCAACGACCATCTTCCCXGCGTGGACCTGGCCCT^^ 1100 
LEKXKOKLPGVOLAVSO 2S3 



1101 TrrrArrrrrArTTr^TOGA i^L iULUIi l ^cai l LUAGACCTrGGCTCGCAACC IISO 
•VPYLMDLCPFETLARNK 270 



1151 ftrArjusACACt^iGLCTCC^rrcrcx: ^ i Lr rArr^AACACCAGTCG i20a 

T 0 T L S P ? C A L S T Q B £ M 286 



• • • • « 

1201 rAACgATATGA CTACTACCAAACTCTtX^AAATACTATGGCAAIC^ 12S0 

QAYDYYQSLCKVYGtrCC 303 

• « • • ■ 

1251 GGCTAACCCCTTGGGGCCACCCCAAGGCGTGGQ j I ' iTGi CAACGACTTaA^ 1300 

GBTPLGPAQGVG FVtrS L £ 320 



♦ • • • • 

13CI TTGCTCCCATCACCCATACCCCrorCC^ 1350 
ARMTKSPVQDYXTVJIK 336 



• • ■ ' 

1351 ACTCrTGACrCGAATCCGGCCACATTCC L i i tU AACGCGACGCTGTACGC 1400 
TLDSNPATFPLNATLYA 333 
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Figure 5/3 



1401 AGATTTCACCCACGACAACACAATGACGTCA A ' l Ufl ' CG CGGCCTTGGGCC 1450 
OFSKQUTMTSIFAALGL 270 



US I TGTACAACGGGACCGCGAACCIGTCCACGACCGAGATCAACTC^ IS 00 

YHCTAKLSTTcIKSIE 339 

■ 

« • « • « 

LSOl GAGACGCUrGGCTACrCGGCCCCGTGGACCCrrcaTrTCGGGGGCCGA^ 1530 

ETOGVSAAWTVPFGGRA 403 

• ■ • • ■ 

I5SL CTATATCGAGATGATGCAGTGTGATGATrCGGATGAGCCAC^ 1^00 

YISMMQCDOSOEPVVRV 420 



1601 TGCTGGICAACGACCGGGriGGTGCXACrGCAl^^ 16 30 

tVMORVVPLKGCEVDS 436 



16SI CTGGGGCGATGCAAAOGACSACGA tll lU r C AGGGGACTCA Lf ITliliC GCG 1700 
LGRCXRODPVACLSFAR 4S3 



• • • • • 

1701 ACAGCGTCGGAACIGGGAGGG G I ' G llA CG C I XSC rX t U AGTAgGTTTATT 17S0 



QCGfilWEGCYAASE- 466 

• • ■ • • 

17S1 cMXSACtrrccACCTrrcrATccTT^^ laoo 

• • • • 
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FitnirA fi/1 



• • » ■ • 

I AGATTCAACGACCKSACX^AATCCCA^ 50 

M V T 3 



SI TCTGACirrCCTGCnTCGGCGGCCTAT C I U- ' I ' i ' 1 C XG Ggcgagcsgc c c 100 
LTcLLSAAYLLSG 16 



• • * • * 

IQ I ggacccaccgcccggacagggccgcggcgccgaccccgaaacggagTAGA ISO 

R 17 



• • • • • • 

ISl GTGTCT(KXXK:ACCTAG'ria' GC CGGCrCCAA G T CC T G CGATACGGTAGA. 200 
VSAAP5SAGSKSCDTV0 34 



• • • • • 

20L CCTGGGGTACCAGTGCrCCCCTGCGACITCrCATCrATC^ 250 

LGYQCSPATSKLWGQYS 51 

• ■ • • • 

2 S I CGCCATT Li A> I ICG CTCXSAGGACGAGCTGTC CG TCT CG ACTAACCTrCCC 300 

PFFSLEOELSVSSKLP 67 

• • « « • 

301 AAOGATTGGCGGATCAOCTIGGTACAOGTGCTATaSCGCCATGa 350 

iCOCRZTLVQVLSRKGAR 64 



• • • • • 

331 G7ACCCAACCACCirCAACACCAAAAACTATAAGAACCllUlX«ACGGC^ 400 
VPTSSfCSKKYKKLVTAX 101 



• • • • • 

401 TCCAGGCCAATGCCACCGACTTCAAGGGCAAUL I LU.LI L i i iUAAGACG 450 
QANATDPKGfCFAFLfCT 117 



• « • ■ • 

431 TACAACTATACrCTGCXntXaSCATGACCTC^ SOO 

YKYTL GAO 0 LTP'PG £ Q Q 134 

• • • • • 

501 GCTGG TC AACTaXXXATCAACITCrACCAC^^ 530 

LVNSG IKcYQRYICALAR ISI 



• • • t • 

331 GCAGTCTGCTGCCGTlTATTCGCGCCrCACGCTC;^^ 600 
SVV9 PIRASGSORVIA 1^7 
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Figure 6/2 



• • ■ • ■ 

601 TCGGGAGACAAGTTCATCCACGGGTrCCAGCAO^ 630 

SGEKFreCcQQAKLAOP 134 

• • • • • 

6 S I TGGCGCCACGAACCKCXXSCTCCGGCGATTAGT^^ 700 

GATNRAAPAISVtlPSS 20L 



• • • • • 

701 GCG?>a?X:GT:CJ^ACAATACCX^^ 750 
ETFNNTLDKGVCTKFE 21.7 



• • * • • 

751 GCGAGTCAGCTGGgACATCACGTTGCGGC^^ 800 

ASQLGOEVAANFTALfA 234 

• • • • • 

801 ACCCGACATCCGAGCTCGCGCCGACAACCATC V ICCX G GCGTGACGCTGA 850 

POIRARAeKKLPCVTLT 251 



■ • • ■ • 

851 CAGACX?ACGA CGn ' GTC ACTCTAATGGAC A:G CGITCG Ll ' tG A XACGGTA 900 
OEOVVSLMOMCSFOTV 2S7 



• • » • « 

90X GCGCGgA<:rAgX!GACGCAACTCAG(ritrrC^ 950 

ARTSOASQLSPFCQLPT 284 

• • • • • 

9SI TCAgAATgagrmAAfaAffrarAACTACgTria^GTCCTO^ 1000 

KKZWKKr etYl.QSLCKYY 301 

• • • • • « 

lOOI ACGGCTACGGCCCACGCAACCCTCTGGGACCGGC^^ 1030 

GYGAGKPLGPAQGIGF 3L7 



• ■ ■ • • 

1051 ACCAACGACCIOATIGCCGGGTTGACTCGrCCGCCA^ IIQO 
TArELIARLTRSPVQDKT 334 



• « • • • 

110 L CAGCACTAACirCACTCTAGTCrCCAACCCGGCCACCTTCC ^ l"l - G AACG 1150 

ST&rSTLVSNPATFPLtfA 351 

• • • > • 

1151 CrACCATGtACGTOGA C i> i I iLA CACCACAACACCATG G i I^iXJA TCTTC 1200 

TMVVOFSKOMSMVS IF 337 



1201 TTTCCATTGGGCCTGTACAACGGCACrCA^^ 1230 
FA&GCVtrCTEPLSRTSV 384 
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. • • • ' 

12S1 CGAAACCCKrCAAGGAATTGGATGGGT^^ 1300 
ESAKELDGYSASWVVPF 401 



1301 TCGGCGCCCGACCCTACTTCCACACGAIGCA^ 13 SO 

GARAYFETMQCKSESCB 417 



. • • • - 

13 S X CCTCTTCrrrCGCGCTTTOVrrAATG^ 1400 
PLVRALINORVVPLKGC 434 



• • • • . • 

1401 CCacrGTGC»CAACCraX5aCGATOCAAC^^ 1*30 
OVOKLGRCKCHOFVKOL 4S1 



• • • • - 

14 SI TGACTTCGGCCAC»ATCrQQGQGC»ACTG^ LAGTXOAGAT LSOO 

SWARSGGKWGECFS* 465 



ISO I GTCATtGmTCCtATACTCCAATACACajriU. i I AGCCATTCACtTCA 1550 
XSSI Cn ' ICC rC B AACCgCCI GC OC ^^f^ 
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Figure 7/1 



* • • • • 

• • • • • 

SL AAGAATATACrGi C l'CriViGGCTACCATCXXj ^ 100 

MGVFVVLL a 



101 ATCTATCSCGACTCtGTTOSuCAGgcacgcgcaccgccccaggcccaacc ISO 
SXATLFGS XS 



• • • ■ • 

LSI cgcccgccaaccgacaaacagcacagCACATCCGGCACTCCGCTGCGCCC 200 

TSGTALGP 24 



• • • ■ • 

201 CCGTGGMATCACACaSACTGCACCrCM^^ 2S0 
RCKrKSOCTSVORGyQCP 41 



• • • • • 

2S1 TCCCTGAGCrCTCCCATAAATCGGGTCTCTAayiGC 300 
PELSKKWGLITAPYFSL 57 



• • • • • 

301 CAGGATCA A r C C CC Gl I ICCILI UG ACGTCCCGGATCACTGCCACATCAC 350 

QOESPFPLOVPOOCKIT 74 

• • • • • 

35 1 C f I IG I ' C CACCn G CT GC CCCCACATGGAGQGC GG TCrCCAACC^ 400 

FVQVLARKGARSPTDSK 91 



• • • • • 

40L ACACAAAGSOCTATCCCGCCACTArCGCAGGCATCC^ 4S0 
TKAVAATIAAiaXMAT 107 



45 L G CUl ' iU CCGGGCAAATACGC Cl ILLlU AASTCGTACAATrACTCCATGGG 500 
ALFGKYAFLKSYNYSMG 124 



• • • • • 

SOI CrCCGAGAACCTGAAC CC C t I XG GGaySAACCAACrGCAAGATCTGGCCG 550 
SENLNPFGRMQLQOLGA 141 



• ■ • • • 

SSI CCCACrrCTACarrCCCTACrACACCrTCACCCGG^ 600 

QFYRR YOTLTRKXN ? F 137 

• • • ■ • 

€01 GTCCGGGCCGCGGArrCCTCOIGCGTCCACGAATCAGCCGACAA^ 630 

VRAAOSSRVKSSAEKFV 174 



47 



EP0 897010A2 



FitriiT^ 7/2 



• • • • " 

651 CXIACGCKnTCCAAAACaXCGCCAAMCGATCCrC^ 700 
EGFQKARQGOPKAtrPKQ ISL 



• • • • • 

701 AGCCGTCCCCGCGCCrGGAICTACTCATCCCCGAAGGCACCGCCTAC^ 750 

PSPRVOVVIPEGTAYN 207 

• • • • 

75 1 AACACGCTCGAGCACACCATCTGC^CGCCrrCGAGG^ 800 

NTLEKS ICTAFEAS TVG 224 



• • ■ ■ " 

8 0 L CGACGCCGCGGCAGACAACTTCACrGC CUl ' G l rCGCGCCGGCGATCGCCA 850 
OAAAONFTAVFAPAtAK 241 



• • • " ' 

851 AGCCTCTGGJ^GGCa^TCTCCCaXXg^ 900 
RLEAOLPGVQLSAOOV 257 



• • • • • 

901 rrrrx:^Tmxrrr:rrxrr7TrTrcfrT^ 950 
V^^LMAMCPFETVSt.TDO 274 



• • » • • 

AKTLSPFCOLFTAASffT 291 

- . • • " 

QVMVLLSLOKVyCYGG 307 



• • • • ' 

IQSl GGCA Al LC G CT GGG CQCrGTGCAOG CXJGaXX XXlij G^ LIOQ 
GerPLGPV QGVGt^ANS L Z J24 



• • • ♦ • 

llOl CGOCK:a GC I GA CGCCCTCCCCCGTCCACGACCACACCTG<CTCAACAA^ 1150 
ARLTRSPVKOKT CVNNT 34t 



• • • • ■ 

1151 CCCrCGAOGCCAACrCOGCCA CCL ' rC CCGCrCAAOGCrAC 1200 

L OAKPATFPLKATt VA ' 3S7 

. • • • • 

1201 r.X( ILL rg CCAgC ACAGTAACCTOTTC^ I'l^' DXC^GCTGGCrLCT 1250 

OFSKOSNLVSIrWALGL 374 
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Figure 7A1 



I2SI <:rrACAACSGCACC^.(KrCCCTCrCGCACACC:ACg^^^ 1300 
^NGTKPLSQTTVEOrTR 39L 
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Figure 8/a 
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Figure 8/3 
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Figure 15: 
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Figure 17a: 
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Figure 17b: 
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Figure 19: 
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Figure 26: 
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